UltraHorizon: Benchmarking Agent Capabilities in Ultra Long-Horizon Scenarios Paper • 2509.21766 • Published Sep 26 • 23
D-Artemis: A Deliberative Cognitive Framework for Mobile GUI Multi-Agents Paper • 2509.21799 • Published Sep 26 • 8