Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
TIGER-Lab 's Collections
Pixel-Reasoner
MoCha
General-Reasoner
VL-Rethinker
Vamba
TheoremExplain
ABC
VisualWebInstruct
PixelWorld
AceCoder
CritiqueFineTuning
MAmmoTH-VL
ScholarCopilot
VISTA
OmniEdit
MEGA-Bench
VLM2Vec
TIGERScore
MAmmoTH
UniIR
ImagenHub
Science
StructLM
ConsistI2V
Mantis
MAmmoTH2
VideoScore
Long-Context

VLM2Vec

updated 16 days ago

The VLM2Vec embedding models.

Upvote
4

  • TIGER-Lab/VLM2Vec-LoRA

    Text Generation • Updated Jan 1 • 71 • 8

  • TIGER-Lab/VLM2Vec-Full

    Text Generation • Updated Apr 7 • 29.9k • 25

  • TIGER-Lab/MMEB-train

    Viewer • Updated Jan 28 • 2.14M • 2.94k • 14

  • TIGER-Lab/MMEB-eval

    Viewer • Updated Oct 28, 2024 • 37k • 6.96k • 10

  • TIGER-Lab/VLM2Vec-LLaVa-Next

    Image-Text-to-Text • Updated Dec 20, 2024 • 1.13k • 1

  • TIGER-Lab/VLM2Vec-Qwen2VL-7B

    Image-Text-to-Text • Updated 21 days ago • 399 • 3

    Note The current best version VLM2Vec model.


  • TIGER-Lab/VLM2Vec-Qwen2VL-2B

    Image-Text-to-Text • Updated Mar 13 • 960

  • Y-J-Ju/MMEB-eval

    Viewer • Updated 22 days ago • 37k • 209

  • Running
    30
    30

    MMEB Leaderboard

    📊

    The massive multimodal embedding benchmark

Upvote
4
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs