NexaAI/whisper-large-v3-turbo

Quickstart

Run them directly with nexa-sdk installed In nexa-sdk CLI:

NexaAI/whisper-large-v3-turbo

Overview

Whisper is a state-of-the-art model for automatic speech recognition (ASR) and speech translation, proposed in the paper Robust Speech Recognition via Large-Scale Weak Supervision by Alec Radford et al. from OpenAI. Trained on >5M hours of labeled data, Whisper demonstrates a strong ability to generalise to many datasets and domains in a zero-shot setting.

Whisper large-v3-turbo is a finetuned version of a pruned Whisper large-v3. In other words, it's the exact same model, except that the number of decoding layers have reduced from 32 to 4. As a result, the model is way faster, at the expense of a minor quality degradation. You can find more details about it in this GitHub discussion.

Reference

Original model card: openai/whisper-large-v3-turbo

NexaAI
/

whisper-large-v3-turbo-MLX

NexaAI/whisper-large-v3-turbo

Quickstart

Overview

Reference

Model tree for NexaAI/whisper-large-v3-turbo-MLX

Collection including NexaAI/whisper-large-v3-turbo-MLX

Multimodal - MLX