WideSearch: Benchmarking Agentic Broad Info-Seeking Paper • 2508.07999 • Published 18 days ago • 105
Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning Paper • 2506.23127 • Published Jun 29 • 1
DAPO: An Open-Source LLM Reinforcement Learning System at Scale Paper • 2503.14476 • Published Mar 18 • 138
VisuoThink: Empowering LVLM Reasoning with Multimodal Tree Search Paper • 2504.09130 • Published Apr 12 • 12
World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning Paper • 2503.10480 • Published Mar 13 • 54
World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning Paper • 2503.10480 • Published Mar 13 • 54