Whisper-Base Fine-tuned for Armenian ASR

This model is a fine-tuned version of OpenAI's Whisper-base on the Common Voice 20 Armenian dataset for automatic speech recognition.

Training Results

The model was trained for 5.34 epochs with the following final results:

Metric Value
Training Loss 0.122
Training Runtime 10,924 seconds (โ‰ˆ3.03 hours)
Training Samples/Second 7.32
Training Steps/Second 0.46
Total Training Steps 5,000
Epochs 5.34

Evaluation Results

Metric Value
Evaluation Loss 0.201
Word Error Rate (WER) 33.19%
Character Error Rate (CER) 6.98%
BLEU Score 47.71
Exact Match 16.50%
Average Prediction Length 7.69 tokens
Average Label Length 7.77 tokens
Length Ratio 0.989
Evaluation Runtime 1,590 seconds (โ‰ˆ26.5 minutes)
Evaluation Samples/Second 3.68
Evaluation Steps/Second 0.46

Model Details

  • Base Model: openai/whisper-base
  • Language: Armenian (hy)
  • Dataset: Chillarmo/common_voice_20_armenian
  • License: Apache 2.0

Notes

During model loading, there were missing keys in the checkpoint: ['proj_out.weight']. This is a common occurrence when fine-tuning Whisper models and typically doesn't affect performance significantly.

Downloads last month
36
Safetensors
Model size
72.6M params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for Chillarmo/whisper-base-armenian

Finetuned
(538)
this model

Dataset used to train Chillarmo/whisper-base-armenian

Evaluation results