Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Deping 's Collections
LLM_VLM_R1
LLM_Infra
Video_MLLMS
VisionExpertModels
LLMs
VLMS
VideoEncoder
VLM_Datasets
GeneralDetector
MM_Datasets

VideoEncoder

updated Mar 8, 2024

Video Understanding, Video Embedding, Video Tasks

Upvote
1

  • Video as the New Language for Real-World Decision Making

    Paper • 2402.17139 • Published Feb 27, 2024 • 22

  • VideoPrism: A Foundational Visual Encoder for Video Understanding

    Paper • 2402.13217 • Published Feb 20, 2024 • 25

  • World Model on Million-Length Video And Language With RingAttention

    Paper • 2402.08268 • Published Feb 13, 2024 • 39

  • microsoft/xclip-base-patch16-zero-shot

    Video Classification • Updated Sep 12, 2023 • 56.7k • 24

  • MCG-NJU/videomae-large

    Video Classification • Updated Apr 1, 2024 • 4.1k • 32
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs