Padomin
/

t5-base-TEDxJP-2front-1body-2rear

+---
+license: cc-by-sa-4.0
+tags:
+- generated_from_trainer
+datasets:
+- te_dx_jp
+model-index:
+- name: t5-base-TEDxJP-2front-1body-2rear
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# t5-base-TEDxJP-2front-1body-2rear
+This model is a fine-tuned version of [sonoisa/t5-base-japanese](https://huggingface.co/sonoisa/t5-base-japanese) on the te_dx_jp dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.4507
+- Wer: 0.1735
+- Mer: 0.1673
+- Wil: 0.2540
+- Wip: 0.7460
+- Hits: 55769
+- Substitutions: 6394
+- Deletions: 2424
+- Insertions: 2390
+- Cer: 0.1370
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 0.0001
+- train_batch_size: 32
+- eval_batch_size: 32
+- seed: 42
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- lr_scheduler_warmup_ratio: 0.1
+- num_epochs: 10
+### Training results
+| Training Loss | Epoch | Step  | Validation Loss | Wer    | Mer    | Wil    | Wip    | Hits  | Substitutions | Deletions | Insertions | Cer    |
+|:-------------:|:-----:|:-----:|:---------------:|:------:|:------:|:------:|:------:|:-----:|:-------------:|:---------:|:----------:|:------:|
+| 0.6076        | 1.0   | 1457  | 0.4832          | 0.2119 | 0.1996 | 0.2902 | 0.7098 | 54866 | 6838          | 2883      | 3964       | 0.1884 |
+| 0.4983        | 2.0   | 2914  | 0.4324          | 0.1806 | 0.1743 | 0.2618 | 0.7382 | 55275 | 6452          | 2860      | 2354       | 0.1477 |
+| 0.4734        | 3.0   | 4371  | 0.4273          | 0.1781 | 0.1715 | 0.2592 | 0.7408 | 55577 | 6490          | 2520      | 2492       | 0.1396 |
+| 0.4068        | 4.0   | 5828  | 0.4265          | 0.1756 | 0.1692 | 0.2568 | 0.7432 | 55696 | 6477          | 2414      | 2449       | 0.1380 |
+| 0.378         | 5.0   | 7285  | 0.4281          | 0.1764 | 0.1697 | 0.2567 | 0.7433 | 55736 | 6436          | 2415      | 2539       | 0.1391 |
+| 0.3395        | 6.0   | 8742  | 0.4337          | 0.1737 | 0.1675 | 0.2540 | 0.7460 | 55744 | 6376          | 2467      | 2374       | 0.1375 |
+| 0.3282        | 7.0   | 10199 | 0.4388          | 0.1735 | 0.1675 | 0.2542 | 0.7458 | 55711 | 6396          | 2480      | 2331       | 0.1371 |
+| 0.3371        | 8.0   | 11656 | 0.4443          | 0.1736 | 0.1674 | 0.2542 | 0.7458 | 55753 | 6403          | 2431      | 2377       | 0.1368 |
+| 0.307         | 9.0   | 13113 | 0.4470          | 0.1725 | 0.1665 | 0.2528 | 0.7472 | 55770 | 6360          | 2457      | 2323       | 0.1361 |
+| 0.2842        | 10.0  | 14570 | 0.4507          | 0.1735 | 0.1673 | 0.2540 | 0.7460 | 55769 | 6394          | 2424      | 2390       | 0.1370 |
+### Framework versions
+- Transformers 4.21.2
+- Pytorch 1.12.1+cu116
+- Datasets 2.4.0
+- Tokenizers 0.12.1