Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Zhang199 's Collections
TinyLLaVA-Video-R1
TinyLLaVA-Video
TinyLLaVA

TinyLLaVA-Video

updated Apr 14

A Simple Framework of Small-scale LMMs for Video Understanding.

Upvote
-

  • Zhang199/TinyLLaVA-Video-Qwen2.5-3B-Group-16-512

    Video-Text-to-Text • 4B • Updated 19 days ago • 29

  • Zhang199/TinyLLaVA-Video-Qwen2.5-3B-Group-1fps-512

    Video-Text-to-Text • 4B • Updated Apr 24 • 16

  • Zhang199/TinyLLaVA-Video-Qwen2.5-3B-Naive-16-512

    Video-Text-to-Text • 4B • Updated 19 days ago • 23

  • Zhang199/TinyLLaVA-Video-Phi2-Naive-16-512

    Video-Text-to-Text • 3B • Updated 19 days ago • 37

  • Zhang199/TinyLLaVA-Video-v1-training-data

    Updated 17 days ago • 117

  • TinyLLaVA-Video: A Simple Framework of Small-scale Large Multimodal Models for Video Understanding

    Paper • 2501.15513 • Published Jan 26
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs