Qwen/Qwen3-235B-A22B-Instruct-2507 Text Generation • 235B • Updated about 18 hours ago • 40.3k • • 605
meta-llama/Llama-4-Scout-17B-16E-Instruct Image-Text-to-Text • 109B • Updated May 22 • 715k • • 1.04k
Running 2.98k 2.98k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters