Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

VLM2Vec

community
https://github.com/TIGER-AI-Lab/VLM2Vec
Activity Feed

AI & ML interests

Multimodal Embeddings and Retrieval.

Recent Activity

memray  updated a dataset about 24 hours ago
VLM2Vec/mvbench
MINGYISU  authored a paper 19 days ago
Breaking the Batch Barrier (B3) of Contrastive Learning via Smart Batch Mining
MINGYISU  authored a paper 19 days ago
VLM2Vec-V2: Advancing Multimodal Embedding for Videos, Images, and Visual Documents
View all activity

Xuan "Billy" Zhang's profile picture Rui's profile picture Ziyan Jiang's profile picture Xinyi Yang's profile picture Liu's profile picture MINGYI SU's profile picture

models 1

VLM2Vec/VLM2Vec-V2.0

Image-to-Text • Updated 17 days ago • 5.31k • 8

datasets 26

VLM2Vec/mvbench

Viewer • Updated about 24 hours ago • 537 • 90

VLM2Vec/nextqa

Viewer • Updated 1 day ago • 1.56k • 78

VLM2Vec/videomme

Viewer • Updated 1 day ago • 898 • 134

VLM2Vec/egoschema

Viewer • Updated 1 day ago • 549 • 90

VLM2Vec/ActivityNetQA

Viewer • Updated 1 day ago • 550 • 94

VLM2Vec/MomentSeeker

Viewer • Updated Jun 26 • 1.8k • 78

VLM2Vec/Charades-STA

Viewer • Updated Jun 26 • 727 • 60

VLM2Vec/QVHighlight

Viewer • Updated Jun 26 • 1.08k • 222

VLM2Vec/MMEB-V2

Updated Jun 13 • 225

VLM2Vec/Kinetics-700

Viewer • Updated May 31 • 1k • 302
View 26 datasets
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs