Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
MBZUAI 's Collections
NADI 2025 Sub-task 3 datasets
CASS
GeoPixel
BiMediX2
ArTST - Arabic Text Speech Transformer
VideoGPT+
GLaMM
Video-ChatGPT
LLaVA++ (LLaMA-3 and Phi-3-Mini)
PALO
MobiLlama
GeoChat
Satmae++

VideoGPT+

updated Jun 11, 2024

VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding

Upvote
3

  • MBZUAI/VideoGPT-plus_Phi3-mini-4k

    Updated Jun 17, 2024 • 6

  • MBZUAI/VideoGPT-plus_Phi3-mini-4k_Pretrain

    Updated Jun 17, 2024 • 1

  • MBZUAI/VCGBench-Diverse

    Updated Jul 1, 2024 • 140 • 3

  • MBZUAI/VCG-plus_112K

    Viewer • Updated Jun 17, 2024 • 139k • 91 • 6

  • MBZUAI/video_annotation_pipeline

    Viewer • Updated Jun 17, 2024 • 1 • 223 • 2

  • MBZUAI/VideoGPT-plus_Training_Dataset

    Viewer • Updated Jun 6, 2024 • 576k • 390 • 8

  • MBZUAI/VideoGPT-plus_Phi3-mini-4k_Ablations

    Updated Jun 13, 2024

  • MBZUAI/VideoGPT-plus_LLaMA3-8B-8k

    Updated Jun 13, 2024

  • MBZUAI/VideoGPT-plus_Vicuna-13B-4k

    Updated Jun 13, 2024 • 1

  • MBZUAI/VideoGPT-plus_Vicuna-7B-4k

    Updated Jun 13, 2024
Upvote
3
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs