google/owlv2-base-patch16-ensemble Zero-Shot Object Detection β’ 0.2B β’ Updated Oct 31, 2024 β’ 685k β’ 107
meta-llama/Llama-Guard-3-11B-Vision Image-Text-to-Text β’ 11B β’ Updated Nov 18, 2024 β’ 1.96k β’ 62
microsoft/Phi-3.5-vision-instruct Image-Text-to-Text β’ 4B β’ Updated Sep 26, 2024 β’ 927k β’ 698
nvidia/Mistral-NeMo-Minitron-8B-Base Text Generation β’ 8B β’ Updated Aug 22, 2024 β’ 6.34k β’ 176
Running on Zero 779 779 Florence 2 π Analyze images to generate captions, detect objects, or perform OCR