Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
merve
's Collections
Dec 6 Releases π
Nov 29 Releases π²π²
Nov 22 Releases βοΈ
Nov 15 Releases π
Nov 1 Releases
MIT Talk 31/10 Papers
October 25 Releases
LOTUS πͺ·
New Depth Models
BRAVE Models π¦
Computer Vision Backbones π§©
Image Classification Models πΆ π±
Object Detection Models π₯₯
Image Segmentation Models π
Zero-shot Image Classification Models πΌοΈ
Image-to-Image Models π¨
Video Classification Models πΊ
Image-to-Text Models π
Text-to-Image Models π₯
Foundation Models for Vision π§©
Segment Anything Model
OWL-series π¦
SigLIP
Awesome Document AI
SegGPT
Vision Language Models Papers πΌοΈπ¬π
gvhf/owl
gv-hf/owl
merve/owl2
Depth Anything v2 Release
Document VLM Papers
Vision Language Leaderboards
Video Language Models
SAM2
NVEagle
Multimodal RAG
Zero-shot Segmentation
Vision Language Leaderboards
updated
Aug 24
This collection has all the vision language leaderboards.
Upvote
13
+3
Running
93
π₯
Vidore Leaderboard
Running
on
CPU Upgrade
543
π
Open VLM Leaderboard
VLMEvalKit Evaluation Results Collection
Running
535
πΌπ¬
Vision Arena (Testing VLMs side-by-side)
Running
81
π
SEED-Bench Leaderboard
Running
23
π₯
MM-UPD Leaderboard
Running
18
π
MMBench Leaderboard
topyun/SPARK
Viewer
β’
Updated
Aug 23
β’
6.25k
β’
69
β’
15
Upvote
13
+9
Share collection
View history
Collection guide
Browse collections