Qwen/Qwen3-VL-30B-A3B-Instruct Image-Text-to-Text • 31B • Updated Nov 26, 2025 • 815k • • 516
PixMo Collection A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 10 items • Updated Dec 23, 2025 • 85
chancharikm/qwen2.5-vl-7b-cam-motion Video-Text-to-Text • 8B • Updated Sep 19, 2025 • 301 • 17