Predictive Data Selection: The Data That Predicts Is the Data That Teaches Paper • 2503.00808 • Published 8 days ago • 52
Predictive Data Selection: The Data That Predicts Is the Data That Teaches Paper • 2503.00808 • Published 8 days ago • 52
MLGym: A New Framework and Benchmark for Advancing AI Research Agents Paper • 2502.14499 • Published 18 days ago • 177
Running 2.15k 2.15k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
SimpleRL Collection The collection for the Project "Simple Reinforcement Learning for Reasoning" • 2 items • Updated 19 days ago • 5
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention Paper • 2502.11089 • Published 22 days ago • 141
The Lessons of Developing Process Reward Models in Mathematical Reasoning Paper • 2501.07301 • Published Jan 13 • 92