mlx-community/SmolVLM2-500M-Video-Instruct-mlx Video-Text-to-Text β’ Updated Feb 20, 2025 β’ 1.24k β’ 18
Running on A100 232 Omnilingual ASR Media Transcription π 232 Transcribe audio or video into text in any language
Running on Zero 2.58k Voice Clone π£ 2.58k Clone voices and generate speech from text using reference audio