From Intention to Execution: Probing the Generalization Boundaries of Vision-Language-Action Models Paper • 2506.09930 • Published 22 days ago • 8
SAFE: Multitask Failure Detection for Vision-Language-Action Models Paper • 2506.09937 • Published 22 days ago • 9
Hidden in plain sight: VLMs overlook their visual representations Paper • 2506.08008 • Published 24 days ago • 8
RoboTwin 2.0: A Scalable Data Generator and Benchmark with Strong Domain Randomization for Robust Bimanual Robotic Manipulation Paper • 2506.18088 • Published 11 days ago • 16