Step Back to Leap Forward: Self-Backtracking for Boosting Reasoning of Language Models Paper • 2502.04404 • Published Feb 6 • 24
Learning Adaptive Parallel Reasoning with Language Models Paper • 2504.15466 • Published 9 days ago • 42
THOUGHTTERMINATOR: Benchmarking, Calibrating, and Mitigating Overthinking in Reasoning Models Paper • 2504.13367 • Published 13 days ago • 24