Running 2.85k 2.85k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
DistServe: Disaggregating Prefill and Decoding for Goodput-optimized Large Language Model Serving Paper • 2401.09670 • Published Jan 18, 2024 • 2
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated May 1 • 322k • 1.46k
Running 257 257 Qwen2.5 VL 72B Instruct 💻 Interact with a multimodal chatbot using text and images
LLaVA-o1: Let Vision Language Models Reason Step-by-Step Paper • 2411.10440 • Published Nov 15, 2024 • 127
QuantFactory/Llama-3.2-Taiwan-Legal-3B-Instruct-GGUF Text Generation • 3B • Updated Nov 2, 2024 • 1.04k • 11
Search-o1: Agentic Search-Enhanced Large Reasoning Models Paper • 2501.05366 • Published Jan 9 • 100
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper • 2501.17161 • Published Jan 28 • 123
Running on Zero 115 115 Llama3.1 S V0.2 Checkpoint 2024 08 20 😻 Convert text to audio and vice versa
shenzhi-wang/Llama3.1-8B-Chinese-Chat Text Generation • 8B • Updated Jul 29, 2024 • 4.71k • • 262
bullerwins/gradientai_Llama-3-8B-Instruct-262k_exl2_8.0bpw Text Generation • Updated Apr 26, 2024 • 4 • 3