view article Article SmolLM3: smol, multilingual, long-context reasoner By loubnabnl and 22 others • 28 days ago • 606
Running 2.94k 2.94k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning Paper • 2505.16933 • Published May 22 • 33