VLM2Vec

community

https://github.com/TIGER-AI-Lab/VLM2Vec

AI & ML interests

Multimodal Embeddings and Retrieval.

Recent Activity

magicgh authored a paper 20 days ago

Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

Lux1997 updated a dataset 21 days ago

VLM2Vec/MMEB-V3

Shane200023 updated a dataset 27 days ago

VLM2Vec/MMEB-V3

View all activity

Organization Card

Community About org cards

VLM2Vec & MMEB: Benchmarking multimodal embeddings and adapting state-of-the-art multimodal large language models into embedding models.

Website - https://tiger-ai-lab.github.io/VLM2Vec/
Github https://github.com/TIGER-AI-Lab/VLM2Vec

List of Our Papers

Main VLM2Vec / MMEB Series

VLM2Vec / MMEB – Image embedding benchmarking and models. (ICLR2025)
VLM2Vec-V2 / MMEB-V2 – Extension of our previous work to video and visual document tasks. (TMLR2026)

Other Related Papers from Our Team

GAE-Retriever – Benchmark and model for trajectory modeling in GUI environments. (Computer-use Agents@ICML 2025)
B3 – A novel batch mining strategy for contrastive learning. (Neurips2025)

models 1

VLM2Vec/VLM2Vec-V2.0

Image-Text-to-Text • Updated Jul 13, 2025 • 3.74k • 29

datasets 45

VLM2Vec/MMEB-V3

Preview • Updated 21 days ago • 413 • 2

VLM2Vec/GAE-Mind2Web

Viewer • Updated Feb 11 • 12.1k • 53

VLM2Vec/GAE-GUIAct

Viewer • Updated Feb 11 • 74.3k • 10

VLM2Vec/Video_Caption_HN

Viewer • Updated Dec 20, 2025 • 302k • 9

VLM2Vec/MMLongBench-page-fixed

Viewer • Updated Nov 4, 2025 • 8.91k • 1.82k

VLM2Vec/ViDoSeek-page-fixed

Viewer • Updated Nov 4, 2025 • 8.78k • 1.39k

VLM2Vec/MMEB-V2

Updated Sep 24, 2025 • 276 • 2

VLM2Vec/B3-7b

Viewer • Updated Aug 29, 2025 • 1.03M • 10 • 1

VLM2Vec/B3-2b

Viewer • Updated Aug 29, 2025 • 1.03M • 17

VLM2Vec/MVBench

Viewer • Updated Aug 15, 2025 • 4k • 2.51k

View 45 datasets