SaoSamarth commited on
Commit
529e7f1
·
verified ·
1 Parent(s): e8509ab

End of training

Browse files
Files changed (3) hide show
  1. README.md +15 -17
  2. generation_config.json +1 -1
  3. model.safetensors +1 -1
README.md CHANGED
@@ -1,13 +1,13 @@
1
  ---
2
  library_name: transformers
3
  language:
4
- - hi
5
  license: apache-2.0
6
  base_model: openai/whisper-small
7
  tags:
8
  - generated_from_trainer
9
  datasets:
10
- - mozilla-foundation/common_voice_11_0
11
  metrics:
12
  - wer
13
  model-index:
@@ -17,15 +17,13 @@ model-index:
17
  name: Automatic Speech Recognition
18
  type: automatic-speech-recognition
19
  dataset:
20
- name: Common Voice 11.0
21
- type: mozilla-foundation/common_voice_11_0
22
- config: hi
23
- split: None
24
- args: 'config: hi, split: test'
25
  metrics:
26
  - name: Wer
27
  type: wer
28
- value: 46.165241682891725
29
  ---
30
 
31
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -33,10 +31,10 @@ should probably proofread and complete it, then remove this comment. -->
33
 
34
  # Whisper Small Hi - Sanchit Gandhi
35
 
36
- This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the Common Voice 11.0 dataset.
37
  It achieves the following results on the evaluation set:
38
- - Loss: 0.4034
39
- - Wer: 46.1652
40
 
41
  ## Model description
42
 
@@ -67,15 +65,15 @@ The following hyperparameters were used during training:
67
 
68
  ### Training results
69
 
70
- | Training Loss | Epoch | Step | Validation Loss | Wer |
71
- |:-------------:|:------:|:----:|:---------------:|:-------:|
72
- | 0.3874 | 0.0612 | 200 | 0.4768 | 51.3163 |
73
- | 0.3332 | 0.1223 | 400 | 0.4034 | 46.1652 |
74
 
75
 
76
  ### Framework versions
77
 
78
  - Transformers 4.49.0
79
  - Pytorch 2.6.0+cu124
80
- - Datasets 3.4.0
81
- - Tokenizers 0.21.0
 
1
  ---
2
  library_name: transformers
3
  language:
4
+ - km
5
  license: apache-2.0
6
  base_model: openai/whisper-small
7
  tags:
8
  - generated_from_trainer
9
  datasets:
10
+ - Khmer_speech_dataset
11
  metrics:
12
  - wer
13
  model-index:
 
17
  name: Automatic Speech Recognition
18
  type: automatic-speech-recognition
19
  dataset:
20
+ name: Khmer_voice
21
+ type: Khmer_speech_dataset
22
+ args: 'config: km, split: test'
 
 
23
  metrics:
24
  - name: Wer
25
  type: wer
26
+ value: 79.34426229508198
27
  ---
28
 
29
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
31
 
32
  # Whisper Small Hi - Sanchit Gandhi
33
 
34
+ This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the Khmer_voice dataset.
35
  It achieves the following results on the evaluation set:
36
+ - Loss: 0.6083
37
+ - Wer: 79.3443
38
 
39
  ## Model description
40
 
 
65
 
66
  ### Training results
67
 
68
+ | Training Loss | Epoch | Step | Validation Loss | Wer |
69
+ |:-------------:|:------:|:----:|:---------------:|:--------:|
70
+ | 0.3396 | 0.2899 | 200 | 0.6831 | 119.3443 |
71
+ | 0.1706 | 0.5797 | 400 | 0.6083 | 79.3443 |
72
 
73
 
74
  ### Framework versions
75
 
76
  - Transformers 4.49.0
77
  - Pytorch 2.6.0+cu124
78
+ - Datasets 3.4.1
79
+ - Tokenizers 0.21.1
generation_config.json CHANGED
@@ -150,7 +150,7 @@
150
  "<|yo|>": 50325,
151
  "<|zh|>": 50260
152
  },
153
- "language": "hindi",
154
  "max_initial_timestamp_index": 50,
155
  "max_length": 448,
156
  "no_timestamps_token_id": 50363,
 
150
  "<|yo|>": 50325,
151
  "<|zh|>": 50260
152
  },
153
+ "language": "km",
154
  "max_initial_timestamp_index": 50,
155
  "max_length": 448,
156
  "no_timestamps_token_id": 50363,
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:03c4d75082c715320e373358b35ce02cdf1a79739927d041e1d44c317a7b6f6f
3
  size 966995080
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ff633e856deddf8c4a4608d6f1ca5909f6347bf72e6fc61b80acaa65768d32d8
3
  size 966995080