LLaVA-Video - a lmms-lab Collection

lmms-lab 's Collections

EgoLife

LLaVA-OneVision

LongVA

LLaVA-Next-Interleave

LLaVA-Video

updated Feb 21

Models focus on video understanding (previously known as LLaVA-NeXT-Video).

Video Instruction Tuning With Synthetic Data

Paper • 2410.02713 • Published Oct 3, 2024 • 40
lmms-lab/LLaVA-Video-178K

Viewer • Updated Oct 11, 2024 • 1.63M • 41.3k • 157
lmms-lab/LLaVA-Video-7B-Qwen2

Video-Text-to-Text • 8B • Updated Oct 25, 2024 • 33.7k • 105
lmms-lab/LLaVA-Video-72B-Qwen2

Text Generation • 73B • Updated Oct 25, 2024 • 710 • 20
lmms-lab/LLaVA-Video-7B-Qwen2-Video-Only

Text Generation • 8B • Updated Oct 4, 2024 • 428 • 4
lmms-lab/LLaVA-NeXT-Video-32B-Qwen

Video-Text-to-Text • 33B • Updated Oct 4, 2024 • 443 • 15
lmms-lab/LLaVA-NeXT-Video-7B-DPO

Video-Text-to-Text • 7B • Updated Feb 21 • 28.1k • 28
lmms-lab/LLaVA-NeXT-Video-7B

Video-Text-to-Text • 7B • Updated Feb 21 • 180 • 48