deepseek-ai/DeepSeek-R1-Distill-Qwen-32B Text Generation β’ Updated 3 days ago β’ 1.17M β’ β’ 1.19k
Running 1.65k 1.65k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters
mistralai/Mistral-Small-24B-Instruct-2501 Text Generation β’ Updated 24 days ago β’ 764k β’ β’ 823