HuggingFaceTB/SmolVLM2-500M-Video-Instruct Video-Text-to-Text • Updated about 8 hours ago • 1.92k • 31
HuggingFaceTB/SmolVLM2-256M-Video-Instruct Video-Text-to-Text • Updated about 8 hours ago • 1.95k • 26
view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control 23 days ago • 109
ibm-granite/granite-vision-3.1-2b-preview Image-Text-to-Text • Updated about 5 hours ago • 12.3k • 84