TinyLLaVA-Video - a Zhang199 Collection

Zhang199 's Collections

TinyLLaVA-Video-R1

TinyLLaVA-Video

TinyLLaVA-Video

updated Apr 14

A Simple Framework of Small-scale LMMs for Video Understanding.

Zhang199/TinyLLaVA-Video-Qwen2.5-3B-Group-16-512

Video-Text-to-Text • 4B • Updated 19 days ago • 29
Zhang199/TinyLLaVA-Video-Qwen2.5-3B-Group-1fps-512

Video-Text-to-Text • 4B • Updated Apr 24 • 16
Zhang199/TinyLLaVA-Video-Qwen2.5-3B-Naive-16-512

Video-Text-to-Text • 4B • Updated 19 days ago • 23
Zhang199/TinyLLaVA-Video-Phi2-Naive-16-512

Video-Text-to-Text • 3B • Updated 19 days ago • 37
Zhang199/TinyLLaVA-Video-v1-training-data

Updated 17 days ago • 117
TinyLLaVA-Video: A Simple Framework of Small-scale Large Multimodal Models for Video Understanding

Paper • 2501.15513 • Published Jan 26