Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
Libraries
Languages
Licenses
Other
1
Apps
Backyard AI
DiffusionBee
Draw Things
Invoke
Jan
Jellybox
llama.cpp
LM Studio
LocalAI
MLX LM
Msty
node-llama-cpp
Ollama
RecurseChat
Sanctum
TGI
vLLM
Apps with no match
JoyFusion
Inference Providers
Select all
Fireworks
HF Inference API
Hyperbolic
Nebius AI
Inference Providers with no match
Novita
Together AI
Cerebras
Featherless AI
Nscale
fal
SambaNova
Groq
Replicate
Cohere
Misc
Reset Misc
multimodal
Inference Endpoints
text-generation-inference
custom_code
4-bit precision
Eval Results
Merge
8-bit precision
Mixture of Experts
Misc with no match
text-embeddings-inference
Carbon Emissions
Apply filters
Models
1,240
Full-text search
Edit filters
Sort: Trending
Active filters:
multimodal
Clear all
ByteDance/Dolphin
Image-Text-to-Text
•
Updated
May 27
•
40.4k
•
409
Qwen/Qwen2.5-VL-7B-Instruct
Image-Text-to-Text
•
Updated
Apr 6
•
3.61M
•
•
996
Qwen/Qwen2.5-VL-3B-Instruct
Image-Text-to-Text
•
Updated
Apr 6
•
3.43M
•
431
Kwai-Keye/Keye-VL-8B-Preview
Image-Text-to-Text
•
Updated
about 2 hours ago
•
217
•
14
Qwen/Qwen2.5-VL-72B-Instruct
Image-Text-to-Text
•
Updated
21 days ago
•
938k
•
•
500
Hcompany/Holo1-7B
Image-Text-to-Text
•
Updated
17 days ago
•
12.8k
•
206
Qwen/Qwen2.5-Omni-7B
Any-to-Any
•
Updated
Apr 30
•
345k
•
1.68k
lingshu-medical-mllm/Lingshu-7B
Image-Text-to-Text
•
Updated
2 days ago
•
2.45k
•
29
Qwen/Qwen2-VL-7B-Instruct
Image-Text-to-Text
•
Updated
Feb 6
•
889k
•
•
1.21k
Qwen/Qwen2.5-VL-32B-Instruct
Image-Text-to-Text
•
Updated
Apr 14
•
419k
•
•
395
ByteDance-Seed/UI-TARS-1.5-7B
Image-Text-to-Text
•
Updated
Apr 18
•
99.7k
•
305
Qwen/Qwen2.5-Omni-3B
Any-to-Any
•
Updated
Apr 30
•
108k
•
241
openvla/openvla-7b
Image-Text-to-Text
•
Updated
Sep 16, 2024
•
351k
•
128
Qwen/Qwen2.5-VL-72B-Instruct-AWQ
Image-Text-to-Text
•
Updated
Mar 7
•
22.8k
•
57
Qwen/Qwen2.5-VL-7B-Instruct-AWQ
Image-Text-to-Text
•
Updated
Apr 6
•
342k
•
77
stepfun-ai/Step1X-Edit
Image-to-Image
•
Updated
3 days ago
•
748
•
•
294
BAAI/Video-XL-2
Video-Text-to-Text
•
Updated
22 days ago
•
1.12k
•
42
Qwen/Qwen2-VL-2B-Instruct
Image-Text-to-Text
•
Updated
Jan 12
•
1.11M
•
429
jinaai/jina-clip-v2
Feature Extraction
•
Updated
Apr 28
•
46.6k
•
•
255
unsloth/Qwen2.5-VL-7B-Instruct-bnb-4bit
Image-Text-to-Text
•
Updated
May 12
•
46.4k
•
11
remyxai/SpaceThinker-Qwen2.5VL-3B
Image-Text-to-Text
•
Updated
6 days ago
•
4.28k
•
20
remyxai/SpaceOm
Image-Text-to-Text
•
Updated
6 days ago
•
716
•
6
bartowski/Qwen_Qwen2.5-VL-7B-Instruct-GGUF
Image-Text-to-Text
•
Updated
May 8
•
3.54k
•
4
unsloth/Qwen2.5-Omni-7B-GGUF
Any-to-Any
•
Updated
30 days ago
•
17.1k
•
14
unsloth/Qwen2.5-Omni-3B-GGUF
Any-to-Any
•
Updated
30 days ago
•
7.96k
•
10
lingshu-medical-mllm/Lingshu-32B
Image-Text-to-Text
•
Updated
2 days ago
•
1.84k
•
38
csfufu/Revisual-R1-Coldstart
Image-Text-to-Text
•
Updated
2 days ago
•
1.08k
•
5
HuggingFaceM4/idefics-9b-instruct
Text Generation
•
Updated
Oct 12, 2023
•
3.57k
•
106
PsiPi/NousResearch_Nous-Hermes-2-Vision-GGUF
Image-Text-to-Text
•
Updated
Mar 11, 2024
•
618
•
16
HuggingFaceM4/idefics2-8b
Image-Text-to-Text
•
Updated
Oct 14, 2024
•
556k
•
608
Previous
1
2
3
...
42
Next