meta-llama/Llama-3.2-11B-Vision-Instruct Image-Text-to-Text • 11B • Updated Dec 4, 2024 • 470k • • 1.47k
google/owlv2-base-patch16-ensemble Zero-Shot Object Detection • 0.2B • Updated Oct 31, 2024 • 597k • 107
openai/whisper-large-v3-turbo Automatic Speech Recognition • 0.8B • Updated Oct 4, 2024 • 3.51M • 2.47k