Qwen/Qwen3-Coder-480B-A35B-Instruct Text Generation β’ 480B β’ Updated 10 days ago β’ 25.7k β’ β’ 991
Running 2.88k 2.88k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters
meta-llama/Llama-3.3-70B-Instruct Text Generation β’ 71B β’ Updated Dec 21, 2024 β’ 391k β’ β’ 2.46k