eglkan1
/

mBART-TextSimp-LT-BatchSize2-lr1e-4

Text2Text Generation

Generated from Trainer

Model card Files Files and versions Community

eglkan1 commited on Apr 11, 2024

Commit

8a7d9c8

·

verified ·

1 Parent(s): 4fa440c

End of training

Files changed (1) hide show

README.md +19 -2

README.md CHANGED Viewed

@@ -3,6 +3,9 @@ license: mit
 base_model: facebook/mbart-large-50
 tags:
 - generated_from_trainer
 model-index:
 - name: mBART-TextSimp-LT-BatchSize4-lr1e-4
   results: []
@@ -14,6 +17,13 @@ should probably proofread and complete it, then remove this comment. -->
 # mBART-TextSimp-LT-BatchSize4-lr1e-4
 This model is a fine-tuned version of [facebook/mbart-large-50](https://huggingface.co/facebook/mbart-large-50) on the None dataset.
 ## Model description
@@ -41,13 +51,20 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
-- num_epochs: 1
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Sacrebleu | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
-| No log        | 1.0   | 418  | 0.0891          | 0.6619 | 0.4917 | 0.6516 | 38.2708   | 34.2792 |
 ### Framework versions

 base_model: facebook/mbart-large-50
 tags:
 - generated_from_trainer
+metrics:
+- rouge
+- sacrebleu
 model-index:
 - name: mBART-TextSimp-LT-BatchSize4-lr1e-4
   results: []
 # mBART-TextSimp-LT-BatchSize4-lr1e-4
 This model is a fine-tuned version of [facebook/mbart-large-50](https://huggingface.co/facebook/mbart-large-50) on the None dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.0962
+- Rouge1: 0.76
+- Rouge2: 0.6246
+- Rougel: 0.7508
+- Sacrebleu: 53.9078
+- Gen Len: 32.9976
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
+- num_epochs: 8
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Sacrebleu | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
+| 0.0639        | 1.0   | 418  | 0.0779          | 0.7012 | 0.5432 | 0.6904 | 43.0798   | 32.9976 |
+| 0.0653        | 2.0   | 836  | 0.0732          | 0.7197 | 0.5593 | 0.7091 | 44.8483   | 32.9976 |
+| 0.0327        | 3.0   | 1254 | 0.0726          | 0.7319 | 0.5787 | 0.7206 | 47.842    | 32.9976 |
+| 0.0168        | 4.0   | 1672 | 0.0782          | 0.7466 | 0.6031 | 0.7371 | 50.9225   | 32.9976 |
+| 0.013         | 5.0   | 2090 | 0.0804          | 0.7507 | 0.6077 | 0.7409 | 51.8293   | 32.9976 |
+| 0.0032        | 6.0   | 2508 | 0.0846          | 0.7606 | 0.6237 | 0.7507 | 53.5224   | 32.9976 |
+| 0.0012        | 7.0   | 2926 | 0.0911          | 0.7597 | 0.6263 | 0.751  | 54.0182   | 32.9976 |
+| 0.0012        | 8.0   | 3344 | 0.0962          | 0.76   | 0.6246 | 0.7508 | 53.9078   | 32.9976 |
 ### Framework versions