Audio
Collection
Dhivehi Voice AI Collection: Tools for Thaana speech recognition (ASR), text-to-speech (TTS), and audio processing
•
26 items
•
Updated
This model is a fine-tuned version of openai/whisper-large-v3 on an unknown dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
Training Loss | Epoch | Step | Validation Loss | Wer | Wer Ortho |
---|---|---|---|---|---|
0.0426 | 0.0354 | 500 | 0.0501 | 3.6048 | 24.5780 |
0.0293 | 0.0709 | 1000 | 0.0367 | 2.6889 | 21.3792 |
0.0251 | 0.1063 | 1500 | 0.0317 | 2.3869 | 17.6751 |
0.0244 | 0.1418 | 2000 | 0.0296 | 2.2782 | 16.7890 |
0.0209 | 0.1772 | 2500 | 0.0284 | 2.2486 | 16.2831 |
0.0205 | 0.2126 | 3000 | 0.0254 | 1.9749 | 14.9776 |
0.0234 | 0.2481 | 3500 | 0.0261 | 2.1892 | 15.1784 |
0.0229 | 0.2835 | 4000 | 0.0252 | 2.0163 | 15.2648 |
Base model
openai/whisper-large-v3