Do Vision-Language Models Have Internal World Models? Towards an Atomic Evaluation Paper • 2506.21876 • Published 4 days ago • 9
Breaking the Data Barrier -- Building GUI Agents Through Task Generalization Paper • 2504.10127 • Published Apr 14 • 17
Beyond the Binary: Capturing Diverse Preferences With Reward Regularization Paper • 2412.03822 • Published Dec 5, 2024
FINEREASON: Evaluating and Improving LLMs' Deliberate Reasoning through Reflective Puzzle Solving Paper • 2502.20238 • Published Feb 27 • 24
AutoToM: Automated Bayesian Inverse Planning and Model Discovery for Open-ended Theory of Mind Paper • 2502.15676 • Published Feb 21 • 3
AutoToM: Automated Bayesian Inverse Planning and Model Discovery for Open-ended Theory of Mind Paper • 2502.15676 • Published Feb 21 • 3
OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis Paper • 2412.19723 • Published Dec 27, 2024 • 88
OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis Paper • 2412.19723 • Published Dec 27, 2024 • 88