Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
sy1998 's Collections
EarthMind
Video-XL

Video-XL

updated May 20
Upvote
2

  • sy1998/Video_XL

    Updated Oct 25, 2024 • 17

  • sy1998/Video_XL_Training

    Preview • Updated Dec 22, 2024 • 1.25k • 6

  • MLVU: A Comprehensive Benchmark for Multi-Task Long Video Understanding

    Paper • 2406.04264 • Published Jun 6, 2024 • 2

  • Video-XL: Extra-Long Vision Language Model for Hour-Scale Video Understanding

    Paper • 2409.14485 • Published Sep 22, 2024 • 2

  • Video-XL-Pro: Reconstructive Token Compression for Extremely Long Video Understanding

    Paper • 2503.18478 • Published Mar 24 • 1
Upvote
2
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs