Running 2.96k 2.96k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
unsloth/DeepSeek-R1-Distill-Llama-8B-unsloth-bnb-4bit Text Generation • 5B • Updated 19 days ago • 38.4k • 31