Qwen/Qwen3-235B-A22B-Instruct-2507 Text Generation β’ 235B β’ Updated 6 days ago β’ 14.3k β’ β’ 517
naver-hyperclovax/HyperCLOVAX-SEED-Think-14B Text Generation β’ 15B β’ Updated 5 days ago β’ 30.3k β’ 65
meta-llama/Meta-Llama-3-8B-Instruct Text Generation β’ 8B β’ Updated Jun 18 β’ 1.37M β’ β’ 4.09k
Running 2.84k 2.84k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters