view article Article Introducing smolagents: simple agents that write actions in code. Dec 31, 2024 • 997
The Lessons of Developing Process Reward Models in Mathematical Reasoning Paper • 2501.07301 • Published Jan 13 • 99