cantillation commited on
Commit
2bfa915
·
verified ·
1 Parent(s): 53943fa

Model save

Browse files
README.md ADDED
@@ -0,0 +1,142 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model: openai/whisper-tiny
4
+ tags:
5
+ - generated_from_trainer
6
+ metrics:
7
+ - wer
8
+ model-index:
9
+ - name: Teamim-tiny_WeightDecay-0.05_Augmented_Combined-Data_date-10-07-2024_14-33
10
+ results: []
11
+ ---
12
+
13
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
+ should probably proofread and complete it, then remove this comment. -->
15
+
16
+ # Teamim-tiny_WeightDecay-0.05_Augmented_Combined-Data_date-10-07-2024_14-33
17
+
18
+ This model is a fine-tuned version of [openai/whisper-tiny](https://huggingface.co/openai/whisper-tiny) on an unknown dataset.
19
+ It achieves the following results on the evaluation set:
20
+ - Loss: 0.2681
21
+ - Wer: 16.4952
22
+ - Avg Precision Exact: 0.8439
23
+ - Avg Recall Exact: 0.8446
24
+ - Avg F1 Exact: 0.8437
25
+ - Avg Precision Letter Shift: 0.8665
26
+ - Avg Recall Letter Shift: 0.8674
27
+ - Avg F1 Letter Shift: 0.8664
28
+ - Avg Precision Word Level: 0.8711
29
+ - Avg Recall Word Level: 0.8725
30
+ - Avg F1 Word Level: 0.8713
31
+ - Avg Precision Word Shift: 0.9505
32
+ - Avg Recall Word Shift: 0.9538
33
+ - Avg F1 Word Shift: 0.9515
34
+ - Precision Median Exact: 0.9231
35
+ - Recall Median Exact: 0.9231
36
+ - F1 Median Exact: 0.9286
37
+ - Precision Max Exact: 1.0
38
+ - Recall Max Exact: 1.0
39
+ - F1 Max Exact: 1.0
40
+ - Precision Min Exact: 0.0
41
+ - Recall Min Exact: 0.0
42
+ - F1 Min Exact: 0.0
43
+ - Precision Min Letter Shift: 0.0
44
+ - Recall Min Letter Shift: 0.0
45
+ - F1 Min Letter Shift: 0.0
46
+ - Precision Min Word Level: 0.0
47
+ - Recall Min Word Level: 0.0
48
+ - F1 Min Word Level: 0.0
49
+ - Precision Min Word Shift: 0.1429
50
+ - Recall Min Word Shift: 0.125
51
+ - F1 Min Word Shift: 0.1333
52
+
53
+ ## Model description
54
+
55
+ More information needed
56
+
57
+ ## Intended uses & limitations
58
+
59
+ More information needed
60
+
61
+ ## Training and evaluation data
62
+
63
+ More information needed
64
+
65
+ ## Training procedure
66
+
67
+ ### Training hyperparameters
68
+
69
+ The following hyperparameters were used during training:
70
+ - learning_rate: 1e-05
71
+ - train_batch_size: 8
72
+ - eval_batch_size: 32
73
+ - seed: 42
74
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
75
+ - lr_scheduler_type: linear
76
+ - lr_scheduler_warmup_steps: 500
77
+ - training_steps: 500000
78
+ - mixed_precision_training: Native AMP
79
+
80
+ ### Training results
81
+
82
+ | Training Loss | Epoch | Step | Validation Loss | Wer | Avg Precision Exact | Avg Recall Exact | Avg F1 Exact | Avg Precision Letter Shift | Avg Recall Letter Shift | Avg F1 Letter Shift | Avg Precision Word Level | Avg Recall Word Level | Avg F1 Word Level | Avg Precision Word Shift | Avg Recall Word Shift | Avg F1 Word Shift | Precision Median Exact | Recall Median Exact | F1 Median Exact | Precision Max Exact | Recall Max Exact | F1 Max Exact | Precision Min Exact | Recall Min Exact | F1 Min Exact | Precision Min Letter Shift | Recall Min Letter Shift | F1 Min Letter Shift | Precision Min Word Level | Recall Min Word Level | F1 Min Word Level | Precision Min Word Shift | Recall Min Word Shift | F1 Min Word Shift |
83
+ |:-------------:|:-------:|:------:|:---------------:|:--------:|:-------------------:|:----------------:|:------------:|:--------------------------:|:-----------------------:|:-------------------:|:------------------------:|:---------------------:|:-----------------:|:------------------------:|:---------------------:|:-----------------:|:----------------------:|:-------------------:|:---------------:|:-------------------:|:----------------:|:------------:|:-------------------:|:----------------:|:------------:|:--------------------------:|:-----------------------:|:-------------------:|:------------------------:|:---------------------:|:-----------------:|:------------------------:|:---------------------:|:-----------------:|
84
+ | No log | 0.0001 | 1 | 7.6124 | 162.6042 | 0.0008 | 0.0010 | 0.0006 | 0.0056 | 0.0037 | 0.0039 | 0.0052 | 0.0137 | 0.0054 | 0.0457 | 0.0410 | 0.0382 | 0.0 | 0.0 | 0.0 | 0.1429 | 0.5 | 0.1667 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
85
+ | 0.1423 | 0.5167 | 10000 | 0.2069 | 31.4547 | 0.6926 | 0.6967 | 0.6938 | 0.7262 | 0.7306 | 0.7275 | 0.7344 | 0.7389 | 0.7357 | 0.8749 | 0.8835 | 0.8779 | 0.7857 | 0.8 | 0.7879 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
86
+ | 0.0563 | 1.0334 | 20000 | 0.1762 | 24.7742 | 0.7620 | 0.7678 | 0.7642 | 0.7918 | 0.7980 | 0.7942 | 0.7980 | 0.8041 | 0.8003 | 0.9138 | 0.9218 | 0.9169 | 0.8667 | 0.875 | 0.8696 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
87
+ | 0.0349 | 1.5501 | 30000 | 0.1775 | 23.2071 | 0.7764 | 0.7797 | 0.7774 | 0.8058 | 0.8095 | 0.8069 | 0.8123 | 0.8163 | 0.8136 | 0.9192 | 0.9257 | 0.9215 | 0.875 | 0.8824 | 0.8800 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0769 | 0.1 | 0.0870 |
88
+ | 0.0197 | 2.0668 | 40000 | 0.1815 | 21.6369 | 0.7865 | 0.7919 | 0.7886 | 0.8125 | 0.8185 | 0.8148 | 0.8185 | 0.8244 | 0.8208 | 0.9226 | 0.9301 | 0.9255 | 0.9 | 0.9 | 0.8966 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
89
+ | 0.0127 | 2.5834 | 50000 | 0.1851 | 21.2876 | 0.8007 | 0.7992 | 0.7993 | 0.8265 | 0.8250 | 0.8251 | 0.8320 | 0.8310 | 0.8309 | 0.9294 | 0.9301 | 0.9290 | 0.9091 | 0.9 | 0.9 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0909 | 0.0833 | 0.0870 |
90
+ | 0.028 | 3.1001 | 60000 | 0.1917 | 21.2719 | 0.7984 | 0.7993 | 0.7983 | 0.8243 | 0.8255 | 0.8243 | 0.8295 | 0.8315 | 0.8299 | 0.9263 | 0.9306 | 0.9276 | 0.9 | 0.9 | 0.9 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.1 | 0.1 | 0.1000 |
91
+ | 0.0186 | 3.6168 | 70000 | 0.1984 | 20.3688 | 0.8036 | 0.8073 | 0.8049 | 0.8294 | 0.8333 | 0.8307 | 0.8349 | 0.8388 | 0.8362 | 0.9316 | 0.9382 | 0.9341 | 0.9091 | 0.9091 | 0.9091 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.1429 | 0.125 | 0.1333 |
92
+ | 0.0056 | 4.1335 | 80000 | 0.2073 | 20.3751 | 0.8012 | 0.8009 | 0.8005 | 0.8268 | 0.8266 | 0.8261 | 0.8325 | 0.8324 | 0.8319 | 0.9334 | 0.9358 | 0.9338 | 0.9091 | 0.9091 | 0.9091 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0909 | 0.1 | 0.1053 |
93
+ | 0.0071 | 4.6502 | 90000 | 0.2081 | 19.6419 | 0.8040 | 0.8091 | 0.8060 | 0.8293 | 0.8347 | 0.8314 | 0.8351 | 0.8405 | 0.8372 | 0.9324 | 0.9396 | 0.9352 | 0.9091 | 0.9091 | 0.9091 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
94
+ | 0.0036 | 5.1669 | 100000 | 0.2132 | 19.3650 | 0.8079 | 0.8117 | 0.8092 | 0.8322 | 0.8364 | 0.8337 | 0.8373 | 0.8418 | 0.8389 | 0.9366 | 0.9426 | 0.9389 | 0.9091 | 0.9167 | 0.9091 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.1429 | 0.1111 | 0.125 |
95
+ | 0.0063 | 5.6836 | 110000 | 0.2126 | 18.7954 | 0.8193 | 0.8210 | 0.8196 | 0.8443 | 0.8463 | 0.8447 | 0.8498 | 0.8517 | 0.8502 | 0.9414 | 0.9453 | 0.9427 | 0.9167 | 0.9167 | 0.9167 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.1 | 0.1 | 0.1000 |
96
+ | 0.0019 | 6.2003 | 120000 | 0.2191 | 18.6633 | 0.8195 | 0.8200 | 0.8192 | 0.8435 | 0.8441 | 0.8432 | 0.8490 | 0.8496 | 0.8488 | 0.9403 | 0.9429 | 0.9409 | 0.9167 | 0.9167 | 0.9167 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.1429 | 0.125 | 0.1333 |
97
+ | 0.0034 | 6.7170 | 130000 | 0.2204 | 18.7860 | 0.8138 | 0.8145 | 0.8136 | 0.8375 | 0.8384 | 0.8374 | 0.8426 | 0.8436 | 0.8425 | 0.9370 | 0.9403 | 0.9380 | 0.9167 | 0.9167 | 0.9167 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0909 | 0.0909 | 0.0909 |
98
+ | 0.005 | 7.2336 | 140000 | 0.2224 | 18.5877 | 0.8185 | 0.8247 | 0.8211 | 0.8419 | 0.8483 | 0.8445 | 0.8470 | 0.8532 | 0.8495 | 0.9384 | 0.9462 | 0.9416 | 0.9167 | 0.9167 | 0.9167 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.1 | 0.0909 | 0.0952 |
99
+ | 0.0028 | 7.7503 | 150000 | 0.2273 | 19.2863 | 0.8180 | 0.8169 | 0.8169 | 0.8424 | 0.8415 | 0.8414 | 0.8480 | 0.8472 | 0.8471 | 0.9376 | 0.9389 | 0.9376 | 0.9167 | 0.9167 | 0.9167 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
100
+ | 0.0013 | 8.2670 | 160000 | 0.2303 | 18.4776 | 0.8220 | 0.8252 | 0.8230 | 0.8456 | 0.8491 | 0.8468 | 0.8512 | 0.8552 | 0.8526 | 0.9418 | 0.9474 | 0.9439 | 0.9167 | 0.9167 | 0.9167 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.1 | 0.1 | 0.1000 |
101
+ | 0.004 | 8.7837 | 170000 | 0.2310 | 18.8238 | 0.8133 | 0.8142 | 0.8132 | 0.8378 | 0.8389 | 0.8378 | 0.8435 | 0.8446 | 0.8435 | 0.9389 | 0.9423 | 0.9399 | 0.9167 | 0.9167 | 0.9167 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
102
+ | 0.0017 | 9.3004 | 180000 | 0.2357 | 18.0371 | 0.8232 | 0.8248 | 0.8234 | 0.8478 | 0.8497 | 0.8482 | 0.8528 | 0.8551 | 0.8534 | 0.9464 | 0.9507 | 0.9479 | 0.9167 | 0.9167 | 0.9167 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.1 | 0.1 | 0.1000 |
103
+ | 0.0039 | 9.8171 | 190000 | 0.2381 | 18.0717 | 0.8201 | 0.8216 | 0.8204 | 0.8438 | 0.8454 | 0.8441 | 0.8493 | 0.8510 | 0.8496 | 0.9435 | 0.9470 | 0.9445 | 0.9167 | 0.9167 | 0.9167 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
104
+ | 0.0034 | 10.3338 | 200000 | 0.2393 | 18.2668 | 0.8230 | 0.8253 | 0.8236 | 0.8469 | 0.8494 | 0.8476 | 0.8518 | 0.8546 | 0.8527 | 0.9406 | 0.9453 | 0.9423 | 0.9167 | 0.9167 | 0.9167 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
105
+ | 0.0012 | 10.8505 | 210000 | 0.2407 | 18.2857 | 0.8232 | 0.8268 | 0.8245 | 0.8485 | 0.8523 | 0.8498 | 0.8536 | 0.8574 | 0.8549 | 0.9392 | 0.9454 | 0.9415 | 0.9167 | 0.9167 | 0.9167 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0714 | 0.0833 | 0.0870 |
106
+ | 0.0003 | 11.3672 | 220000 | 0.2431 | 18.1409 | 0.8230 | 0.8240 | 0.8230 | 0.8464 | 0.8478 | 0.8465 | 0.8511 | 0.8529 | 0.8515 | 0.9408 | 0.9448 | 0.9421 | 0.9167 | 0.9167 | 0.9231 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.1333 | 0.1 | 0.1176 |
107
+ | 0.0019 | 11.8838 | 230000 | 0.2458 | 18.2227 | 0.8211 | 0.8241 | 0.8221 | 0.8452 | 0.8483 | 0.8462 | 0.8506 | 0.8538 | 0.8517 | 0.9427 | 0.9482 | 0.9447 | 0.9167 | 0.9167 | 0.9167 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
108
+ | 0.0008 | 12.4005 | 240000 | 0.2445 | 18.0591 | 0.8262 | 0.8278 | 0.8265 | 0.8498 | 0.8514 | 0.8500 | 0.8549 | 0.8566 | 0.8552 | 0.9435 | 0.9470 | 0.9446 | 0.9167 | 0.9167 | 0.9231 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
109
+ | 0.0004 | 12.9172 | 250000 | 0.2486 | 17.9647 | 0.8285 | 0.8306 | 0.8290 | 0.8518 | 0.8541 | 0.8524 | 0.8565 | 0.8588 | 0.8571 | 0.9432 | 0.9472 | 0.9445 | 0.9167 | 0.9167 | 0.9167 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
110
+ | 0.0011 | 13.4339 | 260000 | 0.2550 | 17.8577 | 0.8237 | 0.8232 | 0.8229 | 0.8471 | 0.8467 | 0.8463 | 0.8518 | 0.8519 | 0.8513 | 0.9452 | 0.9481 | 0.9460 | 0.9167 | 0.9167 | 0.9231 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.1 | 0.1 | 0.1000 |
111
+ | 0.0004 | 13.9506 | 270000 | 0.2535 | 17.9710 | 0.8245 | 0.8263 | 0.8248 | 0.8485 | 0.8506 | 0.8489 | 0.8537 | 0.8560 | 0.8543 | 0.9438 | 0.9485 | 0.9455 | 0.9167 | 0.9167 | 0.9231 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.1429 | 0.1111 | 0.125 |
112
+ | 0.0008 | 14.4673 | 280000 | 0.2510 | 17.7570 | 0.8331 | 0.8334 | 0.8327 | 0.8569 | 0.8572 | 0.8565 | 0.8626 | 0.8632 | 0.8624 | 0.9458 | 0.9480 | 0.9462 | 0.9167 | 0.9167 | 0.9231 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
113
+ | 0.0004 | 14.9840 | 290000 | 0.2518 | 17.9081 | 0.8274 | 0.8268 | 0.8265 | 0.8509 | 0.8504 | 0.8501 | 0.8557 | 0.8554 | 0.8550 | 0.9443 | 0.9459 | 0.9444 | 0.9167 | 0.9167 | 0.9167 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
114
+ | 0.0001 | 15.5007 | 300000 | 0.2551 | 17.7885 | 0.8296 | 0.8303 | 0.8294 | 0.8531 | 0.8538 | 0.8529 | 0.8583 | 0.8592 | 0.8582 | 0.9433 | 0.9464 | 0.9441 | 0.9167 | 0.9167 | 0.9167 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.1429 | 0.1111 | 0.125 |
115
+ | 0.0005 | 16.0174 | 310000 | 0.2559 | 17.4581 | 0.8316 | 0.8329 | 0.8317 | 0.8544 | 0.8558 | 0.8545 | 0.8600 | 0.8614 | 0.8601 | 0.9438 | 0.9479 | 0.9451 | 0.9231 | 0.9231 | 0.9231 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
116
+ | 0.0001 | 16.5340 | 320000 | 0.2566 | 17.1528 | 0.8355 | 0.8361 | 0.8353 | 0.8571 | 0.8579 | 0.8570 | 0.8617 | 0.8632 | 0.8619 | 0.9475 | 0.9510 | 0.9486 | 0.9231 | 0.9231 | 0.9286 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
117
+ | 0.0002 | 17.0507 | 330000 | 0.2613 | 17.3605 | 0.8336 | 0.8353 | 0.8339 | 0.8563 | 0.8583 | 0.8567 | 0.8612 | 0.8634 | 0.8618 | 0.9483 | 0.9517 | 0.9494 | 0.9231 | 0.9231 | 0.9231 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.1429 | 0.1111 | 0.125 |
118
+ | 0.0001 | 17.5674 | 340000 | 0.2607 | 17.1560 | 0.8381 | 0.8399 | 0.8385 | 0.8602 | 0.8623 | 0.8607 | 0.8652 | 0.8675 | 0.8658 | 0.9477 | 0.9518 | 0.9491 | 0.9231 | 0.9231 | 0.9286 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.1 | 0.1 | 0.1000 |
119
+ | 0.0 | 18.0841 | 350000 | 0.2630 | 17.0962 | 0.8374 | 0.8390 | 0.8377 | 0.8608 | 0.8626 | 0.8612 | 0.8660 | 0.8681 | 0.8665 | 0.9472 | 0.9515 | 0.9487 | 0.9231 | 0.9231 | 0.9231 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
120
+ | 0.0008 | 18.6008 | 360000 | 0.2622 | 17.3763 | 0.8335 | 0.8353 | 0.8339 | 0.8555 | 0.8577 | 0.8561 | 0.8603 | 0.8632 | 0.8612 | 0.9443 | 0.9492 | 0.9461 | 0.9231 | 0.9231 | 0.9231 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
121
+ | 0.0 | 19.1175 | 370000 | 0.2637 | 17.1245 | 0.8343 | 0.8365 | 0.8349 | 0.8574 | 0.8598 | 0.8581 | 0.8623 | 0.8652 | 0.8632 | 0.9469 | 0.9518 | 0.9487 | 0.9231 | 0.9231 | 0.9231 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
122
+ | 0.0 | 19.6342 | 380000 | 0.2651 | 17.2535 | 0.8347 | 0.8359 | 0.8348 | 0.8572 | 0.8585 | 0.8573 | 0.8621 | 0.8638 | 0.8624 | 0.9461 | 0.9504 | 0.9476 | 0.9231 | 0.9231 | 0.9231 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0833 | 0.1 | 0.0909 |
123
+ | 0.0 | 20.1509 | 390000 | 0.2621 | 16.8130 | 0.8345 | 0.8358 | 0.8347 | 0.8569 | 0.8584 | 0.8572 | 0.8616 | 0.8632 | 0.8619 | 0.9495 | 0.9528 | 0.9506 | 0.9231 | 0.9231 | 0.9231 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
124
+ | 0.0001 | 20.6676 | 400000 | 0.2675 | 16.9420 | 0.8399 | 0.8397 | 0.8393 | 0.8622 | 0.8621 | 0.8616 | 0.8667 | 0.8673 | 0.8665 | 0.9490 | 0.9514 | 0.9495 | 0.9231 | 0.9231 | 0.9231 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.1 | 0.1 | 0.1000 |
125
+ | 0.0007 | 21.1843 | 410000 | 0.2629 | 16.6557 | 0.8436 | 0.8446 | 0.8435 | 0.8659 | 0.8671 | 0.8659 | 0.8705 | 0.8722 | 0.8708 | 0.9495 | 0.9534 | 0.9508 | 0.9231 | 0.9231 | 0.9286 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.1 | 0.1 | 0.1000 |
126
+ | 0.0 | 21.7009 | 420000 | 0.2677 | 16.7910 | 0.8367 | 0.8380 | 0.8368 | 0.8588 | 0.8604 | 0.8591 | 0.8635 | 0.8656 | 0.8641 | 0.9480 | 0.9522 | 0.9495 | 0.9231 | 0.9231 | 0.9231 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
127
+ | 0.0 | 22.2176 | 430000 | 0.2676 | 16.7626 | 0.8367 | 0.8387 | 0.8372 | 0.8595 | 0.8618 | 0.8601 | 0.8641 | 0.8667 | 0.8648 | 0.9474 | 0.9526 | 0.9493 | 0.9231 | 0.9231 | 0.9286 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
128
+ | 0.0 | 22.7343 | 440000 | 0.2664 | 16.8004 | 0.8392 | 0.8411 | 0.8397 | 0.8613 | 0.8634 | 0.8618 | 0.8659 | 0.8686 | 0.8667 | 0.9475 | 0.9518 | 0.9490 | 0.9231 | 0.9231 | 0.9286 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
129
+ | 0.0 | 23.2510 | 450000 | 0.2645 | 16.5455 | 0.8441 | 0.8445 | 0.8438 | 0.8665 | 0.8670 | 0.8662 | 0.8711 | 0.8721 | 0.8711 | 0.9504 | 0.9534 | 0.9513 | 0.9231 | 0.9231 | 0.9286 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.1429 | 0.125 | 0.1333 |
130
+ | 0.0 | 23.7677 | 460000 | 0.2675 | 16.5927 | 0.8414 | 0.8427 | 0.8416 | 0.8638 | 0.8652 | 0.8640 | 0.8682 | 0.8700 | 0.8686 | 0.9490 | 0.9526 | 0.9502 | 0.9231 | 0.9231 | 0.9286 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.1429 | 0.125 | 0.1333 |
131
+ | 0.0 | 24.2844 | 470000 | 0.2676 | 16.4857 | 0.8427 | 0.8438 | 0.8428 | 0.8648 | 0.8661 | 0.8650 | 0.8695 | 0.8711 | 0.8698 | 0.9497 | 0.9531 | 0.9508 | 0.9231 | 0.9231 | 0.9286 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
132
+ | 0.0 | 24.8011 | 480000 | 0.2673 | 16.5959 | 0.8431 | 0.8436 | 0.8428 | 0.8651 | 0.8658 | 0.8649 | 0.8697 | 0.8709 | 0.8698 | 0.9491 | 0.9523 | 0.9501 | 0.9231 | 0.9231 | 0.9286 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.125 | 0.1176 | 0.1212 |
133
+ | 0.0 | 25.3178 | 490000 | 0.2677 | 16.5707 | 0.8434 | 0.8440 | 0.8432 | 0.8656 | 0.8664 | 0.8655 | 0.8701 | 0.8714 | 0.8703 | 0.9500 | 0.9532 | 0.9510 | 0.9231 | 0.9231 | 0.9286 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.1429 | 0.125 | 0.1333 |
134
+ | 0.0 | 25.8345 | 500000 | 0.2681 | 16.4952 | 0.8439 | 0.8446 | 0.8437 | 0.8665 | 0.8674 | 0.8664 | 0.8711 | 0.8725 | 0.8713 | 0.9505 | 0.9538 | 0.9515 | 0.9231 | 0.9231 | 0.9286 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.1429 | 0.125 | 0.1333 |
135
+
136
+
137
+ ### Framework versions
138
+
139
+ - Transformers 4.41.2
140
+ - Pytorch 2.2.1
141
+ - Datasets 2.20.0
142
+ - Tokenizers 0.19.1
generation_config.json ADDED
@@ -0,0 +1,248 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "alignment_heads": [
3
+ [
4
+ 2,
5
+ 2
6
+ ],
7
+ [
8
+ 3,
9
+ 0
10
+ ],
11
+ [
12
+ 3,
13
+ 2
14
+ ],
15
+ [
16
+ 3,
17
+ 3
18
+ ],
19
+ [
20
+ 3,
21
+ 4
22
+ ],
23
+ [
24
+ 3,
25
+ 5
26
+ ]
27
+ ],
28
+ "begin_suppress_tokens": [
29
+ 220,
30
+ 50257
31
+ ],
32
+ "bos_token_id": 50257,
33
+ "decoder_start_token_id": 50258,
34
+ "eos_token_id": 50257,
35
+ "forced_decoder_ids": [
36
+ [
37
+ 1,
38
+ null
39
+ ],
40
+ [
41
+ 2,
42
+ 50359
43
+ ]
44
+ ],
45
+ "is_multilingual": true,
46
+ "lang_to_id": {
47
+ "<|af|>": 50327,
48
+ "<|am|>": 50334,
49
+ "<|ar|>": 50272,
50
+ "<|as|>": 50350,
51
+ "<|az|>": 50304,
52
+ "<|ba|>": 50355,
53
+ "<|be|>": 50330,
54
+ "<|bg|>": 50292,
55
+ "<|bn|>": 50302,
56
+ "<|bo|>": 50347,
57
+ "<|br|>": 50309,
58
+ "<|bs|>": 50315,
59
+ "<|ca|>": 50270,
60
+ "<|cs|>": 50283,
61
+ "<|cy|>": 50297,
62
+ "<|da|>": 50285,
63
+ "<|de|>": 50261,
64
+ "<|el|>": 50281,
65
+ "<|en|>": 50259,
66
+ "<|es|>": 50262,
67
+ "<|et|>": 50307,
68
+ "<|eu|>": 50310,
69
+ "<|fa|>": 50300,
70
+ "<|fi|>": 50277,
71
+ "<|fo|>": 50338,
72
+ "<|fr|>": 50265,
73
+ "<|gl|>": 50319,
74
+ "<|gu|>": 50333,
75
+ "<|haw|>": 50352,
76
+ "<|ha|>": 50354,
77
+ "<|he|>": 50279,
78
+ "<|hi|>": 50276,
79
+ "<|hr|>": 50291,
80
+ "<|ht|>": 50339,
81
+ "<|hu|>": 50286,
82
+ "<|hy|>": 50312,
83
+ "<|id|>": 50275,
84
+ "<|is|>": 50311,
85
+ "<|it|>": 50274,
86
+ "<|ja|>": 50266,
87
+ "<|jw|>": 50356,
88
+ "<|ka|>": 50329,
89
+ "<|kk|>": 50316,
90
+ "<|km|>": 50323,
91
+ "<|kn|>": 50306,
92
+ "<|ko|>": 50264,
93
+ "<|la|>": 50294,
94
+ "<|lb|>": 50345,
95
+ "<|ln|>": 50353,
96
+ "<|lo|>": 50336,
97
+ "<|lt|>": 50293,
98
+ "<|lv|>": 50301,
99
+ "<|mg|>": 50349,
100
+ "<|mi|>": 50295,
101
+ "<|mk|>": 50308,
102
+ "<|ml|>": 50296,
103
+ "<|mn|>": 50314,
104
+ "<|mr|>": 50320,
105
+ "<|ms|>": 50282,
106
+ "<|mt|>": 50343,
107
+ "<|my|>": 50346,
108
+ "<|ne|>": 50313,
109
+ "<|nl|>": 50271,
110
+ "<|nn|>": 50342,
111
+ "<|no|>": 50288,
112
+ "<|oc|>": 50328,
113
+ "<|pa|>": 50321,
114
+ "<|pl|>": 50269,
115
+ "<|ps|>": 50340,
116
+ "<|pt|>": 50267,
117
+ "<|ro|>": 50284,
118
+ "<|ru|>": 50263,
119
+ "<|sa|>": 50344,
120
+ "<|sd|>": 50332,
121
+ "<|si|>": 50322,
122
+ "<|sk|>": 50298,
123
+ "<|sl|>": 50305,
124
+ "<|sn|>": 50324,
125
+ "<|so|>": 50326,
126
+ "<|sq|>": 50317,
127
+ "<|sr|>": 50303,
128
+ "<|su|>": 50357,
129
+ "<|sv|>": 50273,
130
+ "<|sw|>": 50318,
131
+ "<|ta|>": 50287,
132
+ "<|te|>": 50299,
133
+ "<|tg|>": 50331,
134
+ "<|th|>": 50289,
135
+ "<|tk|>": 50341,
136
+ "<|tl|>": 50348,
137
+ "<|tr|>": 50268,
138
+ "<|tt|>": 50351,
139
+ "<|uk|>": 50280,
140
+ "<|ur|>": 50290,
141
+ "<|uz|>": 50337,
142
+ "<|vi|>": 50278,
143
+ "<|yi|>": 50335,
144
+ "<|yo|>": 50325,
145
+ "<|zh|>": 50260
146
+ },
147
+ "max_initial_timestamp_index": 50,
148
+ "max_length": 448,
149
+ "no_timestamps_token_id": 50363,
150
+ "pad_token_id": 50257,
151
+ "prev_sot_token_id": 50361,
152
+ "return_timestamps": false,
153
+ "suppress_tokens": [
154
+ 1,
155
+ 2,
156
+ 7,
157
+ 8,
158
+ 9,
159
+ 10,
160
+ 14,
161
+ 25,
162
+ 26,
163
+ 27,
164
+ 28,
165
+ 29,
166
+ 31,
167
+ 58,
168
+ 59,
169
+ 60,
170
+ 61,
171
+ 62,
172
+ 63,
173
+ 90,
174
+ 91,
175
+ 92,
176
+ 93,
177
+ 359,
178
+ 503,
179
+ 522,
180
+ 542,
181
+ 873,
182
+ 893,
183
+ 902,
184
+ 918,
185
+ 922,
186
+ 931,
187
+ 1350,
188
+ 1853,
189
+ 1982,
190
+ 2460,
191
+ 2627,
192
+ 3246,
193
+ 3253,
194
+ 3268,
195
+ 3536,
196
+ 3846,
197
+ 3961,
198
+ 4183,
199
+ 4667,
200
+ 6585,
201
+ 6647,
202
+ 7273,
203
+ 9061,
204
+ 9383,
205
+ 10428,
206
+ 10929,
207
+ 11938,
208
+ 12033,
209
+ 12331,
210
+ 12562,
211
+ 13793,
212
+ 14157,
213
+ 14635,
214
+ 15265,
215
+ 15618,
216
+ 16553,
217
+ 16604,
218
+ 18362,
219
+ 18956,
220
+ 20075,
221
+ 21675,
222
+ 22520,
223
+ 26130,
224
+ 26161,
225
+ 26435,
226
+ 28279,
227
+ 29464,
228
+ 31650,
229
+ 32302,
230
+ 32470,
231
+ 36865,
232
+ 42863,
233
+ 47425,
234
+ 49870,
235
+ 50254,
236
+ 50258,
237
+ 50358,
238
+ 50359,
239
+ 50360,
240
+ 50361,
241
+ 50362
242
+ ],
243
+ "task_to_id": {
244
+ "transcribe": 50359,
245
+ "translate": 50358
246
+ },
247
+ "transformers_version": "4.41.2"
248
+ }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:11ba410a96b84b60c0b1e08c629f0592c195d5620ade43c2acb9dfc402286458
3
  size 151109288
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:47083b6bc0c32c7eae0a3f2c737da92f7b17965b37dcc12e60724d92e88bf25c
3
  size 151109288
runs/Jul10_14-39-42_b9e0e4d4ca6a/events.out.tfevents.1720622383.b9e0e4d4ca6a.1.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:dbc4d5245fd32aa53d6e64c4e972f3b52de42bcf2392de22ef3483035cb98826
3
- size 4330802
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1dafdf6d97a193e413d60dfe14dbd2233860362b3e153bc9443b08b4dfe7b2e2
3
+ size 4419417