Padomin commited on
Commit
0ab4d35
·
1 Parent(s): 0390a66

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +21 -21
README.md CHANGED
@@ -16,16 +16,16 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [sonoisa/t5-base-japanese](https://huggingface.co/sonoisa/t5-base-japanese) on the te_dx_jp dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 0.4743
20
- - Wer: 0.1770
21
- - Mer: 0.1709
22
- - Wil: 0.2594
23
- - Wip: 0.7406
24
- - Hits: 55458
25
- - Substitutions: 6535
26
- - Deletions: 2594
27
- - Insertions: 2305
28
- - Cer: 0.1377
29
 
30
  ## Model description
31
 
@@ -47,7 +47,7 @@ The following hyperparameters were used during training:
47
  - learning_rate: 0.0001
48
  - train_batch_size: 32
49
  - eval_batch_size: 32
50
- - seed: 20
51
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
52
  - lr_scheduler_type: linear
53
  - lr_scheduler_warmup_ratio: 0.1
@@ -57,16 +57,16 @@ The following hyperparameters were used during training:
57
 
58
  | Training Loss | Epoch | Step | Validation Loss | Wer | Mer | Wil | Wip | Hits | Substitutions | Deletions | Insertions | Cer |
59
  |:-------------:|:-----:|:-----:|:---------------:|:------:|:------:|:------:|:------:|:-----:|:-------------:|:---------:|:----------:|:------:|
60
- | 0.6274 | 1.0 | 1457 | 0.4935 | 0.2117 | 0.1996 | 0.2903 | 0.7097 | 54836 | 6843 | 2908 | 3921 | 0.1780 |
61
- | 0.565 | 2.0 | 2914 | 0.4529 | 0.1796 | 0.1740 | 0.2620 | 0.7380 | 55058 | 6472 | 3057 | 2070 | 0.1454 |
62
- | 0.528 | 3.0 | 4371 | 0.4533 | 0.1822 | 0.1752 | 0.2634 | 0.7366 | 55400 | 6528 | 2659 | 2582 | 0.1421 |
63
- | 0.4545 | 4.0 | 5828 | 0.4457 | 0.1750 | 0.1695 | 0.2569 | 0.7431 | 55392 | 6427 | 2768 | 2108 | 0.1358 |
64
- | 0.4455 | 5.0 | 7285 | 0.4468 | 0.1772 | 0.1711 | 0.2594 | 0.7406 | 55425 | 6517 | 2645 | 2281 | 0.1375 |
65
- | 0.3825 | 6.0 | 8742 | 0.4556 | 0.1755 | 0.1697 | 0.2576 | 0.7424 | 55486 | 6482 | 2619 | 2237 | 0.1368 |
66
- | 0.3399 | 7.0 | 10199 | 0.4581 | 0.1765 | 0.1706 | 0.2587 | 0.7413 | 55451 | 6505 | 2631 | 2266 | 0.1382 |
67
- | 0.3217 | 8.0 | 11656 | 0.4631 | 0.1767 | 0.1707 | 0.2594 | 0.7406 | 55458 | 6555 | 2574 | 2283 | 0.1379 |
68
- | 0.3302 | 9.0 | 13113 | 0.4710 | 0.1768 | 0.1706 | 0.2591 | 0.7409 | 55505 | 6538 | 2544 | 2336 | 0.1373 |
69
- | 0.2801 | 10.0 | 14570 | 0.4743 | 0.1770 | 0.1709 | 0.2594 | 0.7406 | 55458 | 6535 | 2594 | 2305 | 0.1377 |
70
 
71
 
72
  ### Framework versions
 
16
 
17
  This model is a fine-tuned version of [sonoisa/t5-base-japanese](https://huggingface.co/sonoisa/t5-base-japanese) on the te_dx_jp dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 0.4714
20
+ - Wer: 0.1751
21
+ - Mer: 0.1694
22
+ - Wil: 0.2572
23
+ - Wip: 0.7428
24
+ - Hits: 55476
25
+ - Substitutions: 6473
26
+ - Deletions: 2638
27
+ - Insertions: 2201
28
+ - Cer: 0.1381
29
 
30
  ## Model description
31
 
 
47
  - learning_rate: 0.0001
48
  - train_batch_size: 32
49
  - eval_batch_size: 32
50
+ - seed: 30
51
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
52
  - lr_scheduler_type: linear
53
  - lr_scheduler_warmup_ratio: 0.1
 
57
 
58
  | Training Loss | Epoch | Step | Validation Loss | Wer | Mer | Wil | Wip | Hits | Substitutions | Deletions | Insertions | Cer |
59
  |:-------------:|:-----:|:-----:|:---------------:|:------:|:------:|:------:|:------:|:-----:|:-------------:|:---------:|:----------:|:------:|
60
+ | 0.6116 | 1.0 | 1457 | 0.4923 | 0.2289 | 0.2127 | 0.3015 | 0.6985 | 54722 | 6733 | 3132 | 4917 | 0.1992 |
61
+ | 0.5362 | 2.0 | 2914 | 0.4506 | 0.1835 | 0.1770 | 0.2661 | 0.7339 | 55105 | 6590 | 2892 | 2369 | 0.1447 |
62
+ | 0.4869 | 3.0 | 4371 | 0.4459 | 0.1806 | 0.1742 | 0.2629 | 0.7371 | 55298 | 6556 | 2733 | 2374 | 0.1424 |
63
+ | 0.4642 | 4.0 | 5828 | 0.4413 | 0.1767 | 0.1710 | 0.2588 | 0.7412 | 55331 | 6462 | 2794 | 2157 | 0.1379 |
64
+ | 0.4395 | 5.0 | 7285 | 0.4462 | 0.1779 | 0.1719 | 0.2594 | 0.7406 | 55367 | 6451 | 2769 | 2270 | 0.1391 |
65
+ | 0.3831 | 6.0 | 8742 | 0.4493 | 0.1751 | 0.1696 | 0.2568 | 0.7432 | 55370 | 6409 | 2808 | 2092 | 0.1369 |
66
+ | 0.3446 | 7.0 | 10199 | 0.4563 | 0.1769 | 0.1710 | 0.2595 | 0.7405 | 55401 | 6535 | 2651 | 2238 | 0.1397 |
67
+ | 0.3031 | 8.0 | 11656 | 0.4657 | 0.1754 | 0.1697 | 0.2578 | 0.7422 | 55436 | 6492 | 2659 | 2179 | 0.1372 |
68
+ | 0.3406 | 9.0 | 13113 | 0.4677 | 0.1750 | 0.1692 | 0.2570 | 0.7430 | 55502 | 6474 | 2611 | 2219 | 0.1365 |
69
+ | 0.3067 | 10.0 | 14570 | 0.4714 | 0.1751 | 0.1694 | 0.2572 | 0.7428 | 55476 | 6473 | 2638 | 2201 | 0.1381 |
70
 
71
 
72
  ### Framework versions