meta-llama/Llama-3.2-11B-Vision-Instruct Image-Text-to-Text • 11B • Updated Dec 4, 2024 • 120k • 1.59k
Running Agents Featured 204 Voxtral TTS Demo ⚡ 204 Generate realistic speech from text with custom or preset voices
ibm-granite/granite-4.0-1b-speech Automatic Speech Recognition • 2B • Updated 25 days ago • 95.1k • 234
distil-whisper/distil-large-v3.5 Automatic Speech Recognition • 0.8B • Updated 14 days ago • 58.6k • 89
distil-whisper/distil-large-v3 Automatic Speech Recognition • 0.8B • Updated 7 days ago • 1.43M • 376
Frame2KG Collection A Benchmark and Evaluation Toolkit for Interpretable Frame-to-Graph Generation • 6 items • Updated Feb 19 • 1