Edit Models filters

Apps

Docker Model Runner

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

1,443

Full-text search

Active filters: multimodal

rednote-hilab/dots.vlm1.inst

Image-Text-to-Text • 672B • Updated 5 days ago • 4.58k • 63

mispeech/midashenglm-7b

Audio-Text-to-Text • 8B • Updated 5 days ago • 1.24k • 52

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Apr 6 • 4.97M • • 1.13k

Skywork/Skywork-UniPic-1.5B

Any-to-Any • Updated 18 minutes ago • 1.85k • 109

unsloth/Qwen2.5-VL-7B-Instruct-GGUF

Image-Text-to-Text • 8B • Updated May 12 • 29.9k • 28

ByteDance/Dolphin

Image-Text-to-Text • 0.4B • Updated 27 days ago • 35.8k • 452

Qwen/Qwen2.5-VL-3B-Instruct

Image-Text-to-Text • 4B • Updated Apr 6 • 3.78M • 481

Qwen/Qwen2.5-Omni-7B

Any-to-Any • 11B • Updated Apr 30 • 132k • 1.74k

ByteDance-Seed/UI-TARS-1.5-7B

Image-Text-to-Text • 8B • Updated Apr 18 • 104k • 342

Qwen/Qwen2.5-VL-32B-Instruct

Image-Text-to-Text • 33B • Updated Apr 14 • 491k • • 421

Qwen/Qwen2.5-VL-72B-Instruct

Image-Text-to-Text • 73B • Updated Jun 6 • 728k • • 523

Qwen/Qwen2.5-Omni-3B

Any-to-Any • 6B • Updated Apr 30 • 225k • 269

huihui-ai/Qwen2.5-VL-7B-Instruct-abliterated

Image-Text-to-Text • 8B • Updated Apr 1 • 1.74k • 23

Qwen/Qwen2-VL-2B-Instruct

Image-Text-to-Text • 2B • Updated Jan 12 • 1.11M • 437

ggml-org/Qwen2.5-Omni-7B-GGUF

Any-to-Any • 8B • Updated May 26 • 2k • 14

Kwai-Keye/Keye-VL-8B-Preview

Video-Text-to-Text • 9B • Updated Jul 7 • 44.3k • 79

TencentARC/ARC-Hunyuan-Video-7B

Video-Text-to-Text • 9B • Updated 6 days ago • 680 • 19

Skywork/UniPic2-Metaquery-9B

Any-to-Any • Updated 10 minutes ago • 3

Qwen/Qwen2-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Feb 6 • 539k • • 1.22k

lmms-lab/LLaVA-Video-7B-Qwen2

Video-Text-to-Text • 8B • Updated Oct 25, 2024 • 84.2k • 106

Qwen/Qwen2-VL-2B

Image-Text-to-Text • 2B • Updated Dec 6, 2024 • 153k • 51

jinaai/jina-clip-v2

Feature Extraction • 0.9B • Updated Apr 28 • 48k • 270

unsloth/Qwen2.5-VL-7B-Instruct-unsloth-bnb-4bit

Image-Text-to-Text • 5B • Updated May 12 • 43.5k • 37

Qwen/Qwen2.5-VL-7B-Instruct-AWQ

Image-Text-to-Text • 3B • Updated Apr 6 • 340k • 83

unsloth/Qwen2.5-VL-32B-Instruct-unsloth-bnb-4bit

Image-Text-to-Text • 20B • Updated May 12 • 6.12k • 14

remyxai/SpaceThinker-Qwen2.5VL-3B

Image-Text-to-Text • 4B • Updated 11 days ago • 8.3k • 27

openbmb/AgentCPM-GUI

Image-Text-to-Text • 8B • Updated Jun 14 • 348 • 126

bartowski/Qwen_Qwen2.5-VL-7B-Instruct-GGUF

Image-Text-to-Text • 8B • Updated May 8 • 3.06k • 6

Mungert/UI-TARS-1.5-7B-GGUF

Image-Text-to-Text • 8B • Updated 2 days ago • 5.02k • 8

Zery/CUA_World_State_Model

Image-Text-to-Text • Updated 5 days ago • 6 • 2