Vision LLM - a SamoXXX Collection

Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

SamoXXX 's Collections

Vision LLM

updated 10 days ago

Collecting best Vision LLMs - to study and learn from them

rhymes-ai/Aria

Image-Text-to-Text • Updated 23 days ago • 16.6k • 628
microsoft/OmniParser

Image-Text-to-Text • Updated Dec 2, 2024 • 807 • 1.66k
jadechoghari/Ferret-UI-Gemma2b

Image-Text-to-Text • Updated Oct 18, 2024 • 313 • 50
jadechoghari/Ferret-UI-Llama8b

Image-Text-to-Text • Updated Jan 8 • 176 • 69
gpt-omni/mini-omni2

Any-to-Any • Updated Oct 24, 2024 • 186 • 271
mPLUG/DocOwl2

Image-Text-to-Text • Updated Sep 27, 2024 • 463 • 99
google/siglip-so400m-patch16-256-i18n

Zero-Shot Image Classification • Updated Nov 18, 2024 • 1.87k • 29
openvla/openvla-7b

Image-Text-to-Text • Updated Sep 16, 2024 • 622k • 113
NexaAIDev/OmniVLM-968M

Updated Dec 17, 2024 • 2.27k • 517
Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • Updated Apr 6 • 3.33M • • 890
ByteDance-Seed/UI-TARS-7B-SFT

Image-Text-to-Text • Updated Jan 25 • 10.7k • 171
moonshotai/Kimi-VL-A3B-Instruct

Image-Text-to-Text • Updated 26 days ago • 88.8k • 191
reducto/RolmOCR

Image-Text-to-Text • Updated Apr 2 • 142k • 405

Collection guide
Browse collections

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs