Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
johannhartmann
's Collections
Multimodal Models
Medical MultiModal
Multimodal Models
updated
Aug 14
A collection of multimodal models for the gpu poor
Upvote
2
google/paligemma-3b-pt-896
Image-Text-to-Text
•
Updated
Jul 19
•
66.1k
•
106
OpenGVLab/InternVL-Chat-V1-5
Image-Text-to-Text
•
Updated
27 days ago
•
14.8k
•
399
alexshengzhili/llava-v1.5-13b-dpo
Text Generation
•
Updated
Apr 13
•
5
•
5
llava-hf/llava-v1.6-mistral-7b-hf
Image-Text-to-Text
•
Updated
about 12 hours ago
•
1.07M
•
221
Qwen/Qwen-VL
Text Generation
•
Updated
Jan 25
•
34.4k
•
202
THUDM/cogvlm2-llama3-chat-19B
Text Generation
•
Updated
16 days ago
•
75.2k
•
194
BK-Lee/MoAI-7B
Updated
Mar 12
•
659
•
45
01-ai/Yi-VL-34B
Image-Text-to-Text
•
Updated
Jun 26
•
199
•
257
mPLUG/DocOwl1.5-Omni
Updated
Apr 10
•
263
•
16
google/paligemma-3b-ft-docvqa-896
Image-Text-to-Text
•
Updated
Jul 19
•
1.65k
•
4
Lin-Chen/open-llava-next-llama3-8b
Image-Text-to-Text
•
Updated
May 27
•
1.47k
•
25
Mizukiluke/mplug_owl_2_1
Updated
Jan 31
•
29
•
11
HuanjinYao/DenseConnector-v1.5-8B
Image-to-Text
•
Updated
May 26
•
4
•
7
microsoft/Phi-3-vision-128k-instruct
Text Generation
•
Updated
about 1 month ago
•
167k
•
890
tiiuae/falcon-11B-vlm
Updated
Jun 12
•
2.15k
•
44
AIDC-AI/Ovis1.5-Llama3-8B
Image-Text-to-Text
•
Updated
Aug 2
•
21.9k
•
23
HuggingFaceM4/Idefics3-8B-Llama3
Image-Text-to-Text
•
Updated
1 day ago
•
39.4k
•
213
openbmb/MiniCPM-V-2_6
Image-Text-to-Text
•
Updated
29 days ago
•
374k
•
724
microsoft/Florence-2-large
Image-Text-to-Text
•
Updated
30 days ago
•
492k
•
1.1k
Upvote
2
Share collection
View history
Collection guide
Browse collections