speech to text
Collection
Speech to text models
•
8 items
•
Updated
This model is a fine-tuned version of openai/whisper-base on the mozilla-foundation/common_voice_16_0 ta dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
Training Loss | Epoch | Step | Validation Loss | Wer |
---|---|---|---|---|
0.1698 | 0.1 | 100 | 0.5723 | 30.4406 |
0.3578 | 0.2 | 200 | 0.4302 | 25.6862 |
0.2832 | 0.3 | 300 | 0.3967 | 23.2048 |
0.2663 | 0.4 | 400 | 0.4038 | 23.8525 |
0.5175 | 0.5 | 500 | 0.3962 | 24.1466 |
0.2365 | 0.6 | 600 | 0.3850 | 22.2595 |
0.1692 | 0.7 | 700 | 0.3960 | 21.8687 |
0.1815 | 0.8 | 800 | 0.3823 | 22.0772 |
0.1612 | 0.9 | 900 | 0.3701 | 21.8056 |
0.1393 | 1.0 | 1000 | 0.375 | 21.4071 |
Base model
openai/whisper-base