Running 1.79k 1.79k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
Running 526 526 Scaling test-time compute 📈 Enhance math problem solving by scaling test-time compute
Zebra: In-Context and Generative Pretraining for Solving Parametric PDEs Paper • 2410.03437 • Published Oct 4, 2024 • 2