Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Oliver2021 's Collections
VLA
Video-gen
Image-gen
Agent
MLLM
Long context
LLM understanding
RAG
MM-EVAL
reasoning
MMLM

MM-EVAL

updated 18 days ago
Upvote
-

  • MMRA: A Benchmark for Multi-granularity Multi-image Relational Association

    Paper • 2407.17379 • Published Jul 24, 2024 • 3

  • MMSearch: Benchmarking the Potential of Large Models as Multi-modal Search Engines

    Paper • 2409.12959 • Published Sep 19, 2024 • 38

  • MMMR: Benchmarking Massive Multi-Modal Reasoning Tasks

    Paper • 2505.16459 • Published May 22 • 45

  • VideoReasonBench: Can MLLMs Perform Vision-Centric Complex Video Reasoning?

    Paper • 2505.23359 • Published about 1 month ago • 39

  • MORSE-500: A Programmatically Controllable Video Benchmark to Stress-Test Multimodal Reasoning

    Paper • 2506.05523 • Published 23 days ago • 33
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs