Long-Context Autoregressive Video Modeling with Next-Frame Prediction Paper โข 2503.19325 โข Published Mar 25 โข 72
ShowUI: One Vision-Language-Action Model for GUI Visual Agent Paper โข 2411.17465 โข Published Nov 26, 2024 โข 87
WorldGUI: Dynamic Testing for Comprehensive Desktop GUI Automation Paper โข 2502.08047 โข Published Feb 12 โข 27