The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models Paper • 2505.22617 • Published 10 days ago • 116
SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond Paper • 2505.19641 • Published 13 days ago • 64
Pitfalls of Rule- and Model-based Verifiers -- A Case Study on Mathematical Reasoning Paper • 2505.22203 • Published 11 days ago • 6
ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows Paper • 2505.19897 • Published 13 days ago • 101
Pitfalls of Rule- and Model-based Verifiers -- A Case Study on Mathematical Reasoning Paper • 2505.22203 • Published 11 days ago • 6
Pitfalls of Rule- and Model-based Verifiers -- A Case Study on Mathematical Reasoning Paper • 2505.22203 • Published 11 days ago • 6 • 2