LearnAct: Few-Shot Mobile GUI Agent with a Unified Demonstration Benchmark Paper • 2504.13805 • Published 4 days ago • 6
MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency Paper • 2502.09621 • Published Feb 13 • 28