Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
Libraries
Languages
Licenses
Other
1
Apps
Backyard AI
Jan
Jellybox
llama.cpp
LM Studio
LocalAI
Msty
node-llama-cpp
Ollama
RecurseChat
Sanctum
TGI
vLLM
Apps with no match
Draw Things
DiffusionBee
Invoke
JoyFusion
MLX LM
Inference Providers
Inference Providers with no match
Fireworks
Novita
Nebius AI
Together AI
Cerebras
Featherless AI
Nscale
fal
Hyperbolic
SambaNova
Groq
Replicate
Cohere
HF Inference API
Misc
Reset Misc
vision-language
Inference Endpoints
custom_code
text-generation-inference
Eval Results
4-bit precision
8-bit precision
Carbon Emissions
Misc with no match
Merge
text-embeddings-inference
Mixture of Experts
Apply filters
Models
253
Full-text search
Edit filters
Sort: Trending
Active filters:
vision-language
Clear all
ctranslate2-4you/GOT-OCR2_0-Customized
Image-Text-to-Text
•
0.7B
•
Updated
Feb 17
•
5
sbintuitions/sarashina2-vision-8b
Image-to-Text
•
8B
•
Updated
Mar 27
•
1.75k
•
5
sbintuitions/sarashina2-vision-14b
Image-to-Text
•
14B
•
Updated
Mar 27
•
781
•
8
UMA-IA/AQUILA-Engine-v1
Image-to-Text
•
8B
•
Updated
Mar 16
•
18
•
1
jpark677/internvl2-8b-mmbench-lora-ep-1-waa-false
Image-to-Text
•
8B
•
Updated
Apr 3
•
8
jpark677/internvl2-8b-mmbench-lora-ep-2-waa-false
Image-to-Text
•
8B
•
Updated
Apr 3
•
8
mradermacher/AQUILA-Engine-v1-GGUF
8B
•
Updated
Apr 12
•
110
mradermacher/AQUILA-Engine-v1-i1-GGUF
8B
•
Updated
Apr 12
•
270
TheEighthDay/SeekWorld_RL_PLUS
8B
•
Updated
Apr 19
•
1.81k
•
1
mradermacher/SeekWorld_RL_PLUS-GGUF
8B
•
Updated
Apr 16
•
630
nkkbr/ViCA-ARKitScenes
Video-Text-to-Text
•
8B
•
Updated
May 7
•
10
nkkbr/ViCA-ScanNet
Video-Text-to-Text
•
8B
•
Updated
May 7
•
8
nkkbr/ViCA-base
Video-Text-to-Text
•
8B
•
Updated
May 7
•
10
nkkbr/ViCA
Video-Text-to-Text
•
8B
•
Updated
about 1 month ago
•
32
nkkbr/ViCA-ScanNetPP
Video-Text-to-Text
•
8B
•
Updated
May 7
•
11
nkkbr/ViCA2-stage1-align
Video-Text-to-Text
•
8B
•
Updated
May 15
•
14
nkkbr/ViCA2-stage2-onevision-ft
Video-Text-to-Text
•
8B
•
Updated
May 15
•
152
nkkbr/ViCA2
Video-Text-to-Text
•
8B
•
Updated
about 1 month ago
•
38
nkkbr/ViCA2-init
Video-Text-to-Text
•
8B
•
Updated
May 15
•
43
ChongyuWang/ShowUI_Grounding_Qwen_2B_pretrained
Updated
Apr 26
•
8
kevin510/friday
Text Generation
•
4B
•
Updated
9 days ago
•
154
yemalin/furniture-captioner
0.2B
•
Updated
May 4
•
9
ragunath-ravi/blip-histopathology-finetuned
Image-to-Text
•
0.2B
•
Updated
May 4
•
13
•
1
nkkbr/ViCA2-thinkng
Video-Text-to-Text
•
8B
•
Updated
May 15
•
8
nkkbr/ViCA-thinking
Video-Text-to-Text
•
8B
•
Updated
May 7
•
11
aosm/Qwen2-VL-7B-PMC-VQA
Updated
May 10
Wauplin/vanilla-nanovlm
Image-Text-to-Text
•
0.2B
•
Updated
May 7
•
61
ariG23498/nanoVLM-demo
Image-Text-to-Text
•
Updated
May 7
•
13
srai86825/qwen-vl-tool-assistant-lora
Text Generation
•
Updated
May 9
KendrickX/openvla-7b-lora-cones
Updated
May 10
Previous
1
2
3
4
5
...
9
Next