Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 42
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
Safetensors
ONNX
GGUF
Transformers.js
MLX
Keras
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 13
Inference Providers
Fireworks
Cerebras
Nebius AI
Novita
Together AI
fal
Nscale
Groq
+ 9
Apply filters
Models
6,932
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
cjpais/llava-1.6-mistral-7b-gguf
Image-Text-to-Text
•
7B
•
Updated
Mar 6, 2024
•
4.64k
•
108
Intel/llava-gemma-2b
Image-Text-to-Text
•
3B
•
Updated
Jun 11, 2024
•
1.06k
•
45
google/paligemma-3b-mix-448
Image-Text-to-Text
•
3B
•
Updated
Jul 19, 2024
•
4.74k
•
115
google/paligemma-3b-pt-896
Image-Text-to-Text
•
3B
•
Updated
Jun 22
•
1.26k
•
121
HuggingFaceM4/Idefics3-8B-Llama3
Image-Text-to-Text
•
8B
•
Updated
Dec 2, 2024
•
40.2k
•
295
Qwen/Qwen2-VL-2B-Instruct
Image-Text-to-Text
•
2B
•
Updated
Jan 12
•
3.64M
•
453
stepfun-ai/GOT-OCR2_0
Image-Text-to-Text
•
0.7B
•
Updated
Feb 4
•
36.8k
•
1.52k
allenai/Molmo-7B-D-0924
Image-Text-to-Text
•
8B
•
Updated
about 11 hours ago
•
24.7k
•
549
huihui-ai/Qwen2-VL-7B-Instruct-abliterated
Image-Text-to-Text
•
8B
•
Updated
Nov 19, 2024
•
82
•
19
xing0047/cca-llava-1.5-7b
Image-Text-to-Text
•
Updated
Oct 28, 2024
•
228
•
3
HuggingFaceTB/SmolVLM-Instruct
Image-Text-to-Text
•
2B
•
Updated
Apr 8
•
29.4k
•
550
unsloth/Qwen2-VL-2B-Instruct
Image-Text-to-Text
•
2B
•
Updated
Mar 9
•
14.8k
•
5
mlx-community/Molmo-7B-D-0924-bf16
Image-Text-to-Text
•
8B
•
Updated
Jan 1
•
585
•
1
deepseek-ai/deepseek-vl2-tiny
Image-Text-to-Text
•
3B
•
Updated
Dec 18, 2024
•
57.8k
•
217
deepseek-ai/deepseek-vl2
Image-Text-to-Text
•
27B
•
Updated
Dec 18, 2024
•
3.19k
•
361
prithivMLmods/Qwen2-VL-OCR-2B-Instruct
Image-Text-to-Text
•
2B
•
Updated
May 2
•
3.51k
•
101
ByteDance/Sa2VA-4B
Image-Text-to-Text
•
4B
•
Updated
Sep 8
•
152k
•
•
82
Qwen/Qwen2.5-VL-72B-Instruct
Image-Text-to-Text
•
73B
•
Updated
Jun 6
•
773k
•
•
549
mlx-community/Qwen2.5-VL-7B-Instruct-8bit
Image-Text-to-Text
•
2B
•
Updated
Feb 25
•
1.44k
•
18
krutrim-ai-labs/Chitrarth
Image-Text-to-Text
•
8B
•
Updated
Mar 26
•
105
•
16
HuggingFaceTB/SmolVLM2-2.2B-Instruct
Image-Text-to-Text
•
2B
•
Updated
Apr 8
•
106k
•
273
AIDC-AI/Ovis2-1B
Image-Text-to-Text
•
1B
•
Updated
Aug 15
•
19.3k
•
95
HuggingFaceTB/SmolVLM2-256M-Video-Instruct
Image-Text-to-Text
•
0.3B
•
Updated
Apr 8
•
664k
•
80
Qwen/Qwen2.5-VL-7B-Instruct-AWQ
Image-Text-to-Text
•
3B
•
Updated
Apr 6
•
448k
•
90
CohereLabs/aya-vision-32b
Image-Text-to-Text
•
33B
•
Updated
28 days ago
•
251
•
•
217
mradermacher/ToriiGate-v0.4-7B-GGUF
Image-Text-to-Text
•
8B
•
Updated
Jul 31
•
657
•
2
mradermacher/ToriiGate-v0.4-7B-i1-GGUF
Image-Text-to-Text
•
8B
•
Updated
Jul 11
•
274
•
2
google/shieldgemma-2-4b-it
Image-Text-to-Text
•
4B
•
Updated
Apr 4
•
1.76k
•
130
LANSG/GEM
Image-Text-to-Text
•
7B
•
Updated
Mar 12
•
107
•
5
ayeshaishaq/DriveLMMo1
Image-Text-to-Text
•
8B
•
Updated
Mar 17
•
62
•
6
Previous
1
2
3
4
5
6
...
100
Next