facebook/wav2vec2-xls-r-300m fine-tuned on google/fleurs and mozilla-foundation/common_voice_13_0 for Igbo language.

WER: 0.51

Code for running:

from huggingsound import SpeechRecognitionModel

model = SpeechRecognitionModel("AstralZander/igbo_ASR")
audio_paths = [audio_path] # List with paths to audio
transcriptions = model.transcribe(audio_paths)

transcriptions # List of transcriptions, timestamps and probabilities
transcriptions[ind_audio]['transcription'] # Transcription of audio with the ind_audio index from the audio_paths list
Downloads last month
22
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Datasets used to train AstralZander/igbo_ASR