Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
lmms-lab
's Collections
Aero-1-Audio
EgoLife
VideoMMMU
Multimodal-SAE
LLaVA-Critic
LLaVA-Video
LLaVA-OneVision
LMMs-Eval
LongVA
LLaVA-Next-Interleave
LLaVA-NeXT
LMMs-Eval-Lite
LLaVA-Video
updated
Feb 21
Models focus on video understanding (previously known as LLaVA-NeXT-Video).
Upvote
61
+51
Video Instruction Tuning With Synthetic Data
Paper
•
2410.02713
•
Published
Oct 3, 2024
•
39
lmms-lab/LLaVA-Video-178K
Viewer
•
Updated
Oct 11, 2024
•
1.63M
•
25.2k
•
151
lmms-lab/LLaVA-Video-7B-Qwen2
Video-Text-to-Text
•
Updated
Oct 25, 2024
•
258k
•
101
lmms-lab/LLaVA-Video-72B-Qwen2
Text Generation
•
Updated
Oct 25, 2024
•
243
•
19
lmms-lab/LLaVA-Video-7B-Qwen2-Video-Only
Text Generation
•
Updated
Oct 4, 2024
•
545
•
4
lmms-lab/LLaVA-NeXT-Video-32B-Qwen
Video-Text-to-Text
•
Updated
Oct 4, 2024
•
32
•
15
lmms-lab/LLaVA-NeXT-Video-7B-DPO
Video-Text-to-Text
•
Updated
Feb 21
•
1.17k
•
28
lmms-lab/LLaVA-NeXT-Video-7B
Video-Text-to-Text
•
Updated
Feb 21
•
1.04k
•
48
Upvote
61
+57
Share collection
View history
Collection guide
Browse collections