Running 2.4k 2.4k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
deepseek-ai/DeepSeek-Coder-V2-Instruct Text Generation • Updated Aug 21, 2024 • 5.82k • • 606