CoTA Datasets Collection This collection contains all versions of the CoTA (Chain-of-Thought-and-Action) datasets. • 5 items • Updated 5 days ago • 3
TACO Models Collection This collection contains the best-performing TACO models based on LLaMA-3/Qwen2 and SigLIP/CLIP. • 3 items • Updated 5 days ago • 2
Salesforce/xgen-mm-vid-phi3-mini-r-v1.5-128tokens-16frames Image-Text-to-Text • Updated 7 days ago • 1 • 2
CoTA Datasets Collection This collection contains all versions of the CoTA (Chain-of-Thought-and-Action) datasets. • 5 items • Updated 5 days ago • 3
XGen-MM-1 models and datasets Collection A collection of all XGen-MM (Foundation LMM) models! • 16 items • Updated 6 days ago • 36
Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction Paper • 2412.04454 • Published 20 days ago • 49