OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling Paper • 2506.20512 • Published Jun 25 • 46 • 1
NEMOTRON-CROSSTHINK: Scaling Self-Learning beyond Math Reasoning Paper • 2504.13941 • Published Apr 15 • 11 • 4
OlympicArena Medal Ranks: Who Is the Most Intelligent AI So Far? Paper • 2406.16772 • Published Jun 24, 2024 • 2 • 2
Benchmarking Benchmark Leakage in Large Language Models Paper • 2404.18824 • Published Apr 29, 2024 • 6 • 1