unsloth/Llama-3.2-90B-Vision-Instruct-bnb-4bit Image-Text-to-Text β’ 47B β’ Updated Nov 22, 2024 β’ 2.96k β’ 20
microsoft/Phi-3-vision-128k-instruct Text Generation β’ 4B β’ Updated Aug 20, 2024 β’ 23.6k β’ 960
meta-llama/Meta-Llama-3-70B-Instruct Text Generation β’ 71B β’ Updated 12 days ago β’ 95.1k β’ β’ 1.48k
Running 551 551 Vision Arena (Testing VLMs side-by-side) πΌ Analyze images to detect and label objects
deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct Text Generation β’ 16B β’ Updated Jul 3, 2024 β’ 181k β’ β’ 446