microsoft/Phi-3.5-vision-instruct Image-Text-to-Text β’ 4B β’ Updated Sep 26, 2024 β’ 680k β’ 701
Running 554 554 Vision Arena (Testing VLMs side-by-side) πΌ Analyze images to detect and label objects
internlm/internlm-xcomposer2d5-7b Visual Question Answering β’ Updated Jul 22, 2024 β’ 785k β’ 207
TinyLlama/TinyLlama-1.1B-Chat-v1.0 Text Generation β’ 1B β’ Updated Mar 17, 2024 β’ 1.11M β’ β’ 1.36k