view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face By abidlabs and 4 others • 8 days ago • 138
Hierarchical Budget Policy Optimization for Adaptive Reasoning Paper • 2507.15844 • Published 15 days ago • 16 • 2
LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy Optimization Paper • 2507.15758 • Published 15 days ago • 34 • 1
MathFimer: Enhancing Mathematical Reasoning by Expanding Reasoning Steps through Fill-in-the-Middle Task Paper • 2502.11684 • Published Feb 17 • 2
MathFimer: Enhancing Mathematical Reasoning by Expanding Reasoning Steps through Fill-in-the-Middle Task Paper • 2502.11684 • Published Feb 17 • 2
SVGenius: Benchmarking LLMs in SVG Understanding, Editing and Generation Paper • 2506.03139 • Published Jun 3 • 15
Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization Paper • 2402.17574 • Published Feb 27, 2024
GUI-G$^2$: Gaussian Reward Modeling for GUI Grounding Paper • 2507.15846 • Published 15 days ago • 126
LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy Optimization Paper • 2507.15758 • Published 15 days ago • 34
Hierarchical Budget Policy Optimization for Adaptive Reasoning Paper • 2507.15844 • Published 15 days ago • 16
Hierarchical Budget Policy Optimization for Adaptive Reasoning Paper • 2507.15844 • Published 15 days ago • 16
LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy Optimization Paper • 2507.15758 • Published 15 days ago • 34
GUI-G$^2$: Gaussian Reward Modeling for GUI Grounding Paper • 2507.15846 • Published 15 days ago • 126 • 6