Training complete
Browse files
README.md
CHANGED
@@ -5,6 +5,8 @@ base_model: google/flan-t5-base
|
|
5 |
tags:
|
6 |
- simplification
|
7 |
- generated_from_trainer
|
|
|
|
|
8 |
model-index:
|
9 |
- name: flan-t5-base-lecturaFacil2
|
10 |
results: []
|
@@ -16,6 +18,12 @@ should probably proofread and complete it, then remove this comment. -->
|
|
16 |
# flan-t5-base-lecturaFacil2
|
17 |
|
18 |
This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on an unknown dataset.
|
|
|
|
|
|
|
|
|
|
|
|
|
19 |
|
20 |
## Model description
|
21 |
|
@@ -42,6 +50,42 @@ The following hyperparameters were used during training:
|
|
42 |
- lr_scheduler_type: linear
|
43 |
- num_epochs: 30
|
44 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
45 |
### Framework versions
|
46 |
|
47 |
- Transformers 4.45.1
|
|
|
5 |
tags:
|
6 |
- simplification
|
7 |
- generated_from_trainer
|
8 |
+
metrics:
|
9 |
+
- rouge
|
10 |
model-index:
|
11 |
- name: flan-t5-base-lecturaFacil2
|
12 |
results: []
|
|
|
18 |
# flan-t5-base-lecturaFacil2
|
19 |
|
20 |
This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on an unknown dataset.
|
21 |
+
It achieves the following results on the evaluation set:
|
22 |
+
- Loss: 0.9248
|
23 |
+
- Rouge1: 7.2543
|
24 |
+
- Rouge2: 4.93
|
25 |
+
- Rougel: 6.8455
|
26 |
+
- Rougelsum: 7.1492
|
27 |
|
28 |
## Model description
|
29 |
|
|
|
50 |
- lr_scheduler_type: linear
|
51 |
- num_epochs: 30
|
52 |
|
53 |
+
### Training results
|
54 |
+
|
55 |
+
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
|
56 |
+
|:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|
|
57 |
+
| No log | 1.0 | 126 | 1.0730 | 7.2577 | 5.0999 | 6.93 | 7.1434 |
|
58 |
+
| 1.2321 | 2.0 | 252 | 1.0311 | 7.1992 | 4.8058 | 6.8141 | 7.0763 |
|
59 |
+
| 1.2321 | 3.0 | 378 | 1.0080 | 7.3086 | 5.0542 | 6.9555 | 7.1842 |
|
60 |
+
| 1.0716 | 4.0 | 504 | 0.9922 | 7.4007 | 5.103 | 7.0486 | 7.2928 |
|
61 |
+
| 1.0716 | 5.0 | 630 | 0.9780 | 7.371 | 4.8914 | 6.9432 | 7.2608 |
|
62 |
+
| 1.0098 | 6.0 | 756 | 0.9741 | 7.437 | 4.9959 | 6.9879 | 7.3239 |
|
63 |
+
| 1.0098 | 7.0 | 882 | 0.9598 | 7.4599 | 5.0313 | 7.0512 | 7.3389 |
|
64 |
+
| 0.9732 | 8.0 | 1008 | 0.9569 | 7.3745 | 4.9056 | 6.9218 | 7.2723 |
|
65 |
+
| 0.9732 | 9.0 | 1134 | 0.9509 | 7.4509 | 5.0068 | 6.977 | 7.3181 |
|
66 |
+
| 0.9362 | 10.0 | 1260 | 0.9453 | 7.315 | 4.9174 | 6.8758 | 7.2008 |
|
67 |
+
| 0.9362 | 11.0 | 1386 | 0.9387 | 7.3426 | 4.967 | 6.9522 | 7.2339 |
|
68 |
+
| 0.9081 | 12.0 | 1512 | 0.9407 | 7.4108 | 5.0262 | 6.9448 | 7.2856 |
|
69 |
+
| 0.9081 | 13.0 | 1638 | 0.9364 | 7.3558 | 4.9637 | 6.9302 | 7.2288 |
|
70 |
+
| 0.8894 | 14.0 | 1764 | 0.9339 | 7.3624 | 4.9192 | 6.923 | 7.2278 |
|
71 |
+
| 0.8894 | 15.0 | 1890 | 0.9320 | 7.3651 | 4.9105 | 6.9139 | 7.2334 |
|
72 |
+
| 0.866 | 16.0 | 2016 | 0.9311 | 7.4031 | 4.9454 | 6.9449 | 7.2774 |
|
73 |
+
| 0.866 | 17.0 | 2142 | 0.9328 | 7.3089 | 4.927 | 6.8732 | 7.1792 |
|
74 |
+
| 0.8521 | 18.0 | 2268 | 0.9271 | 7.287 | 4.9457 | 6.8903 | 7.1785 |
|
75 |
+
| 0.8521 | 19.0 | 2394 | 0.9268 | 7.2971 | 4.9465 | 6.8599 | 7.1433 |
|
76 |
+
| 0.8292 | 20.0 | 2520 | 0.9280 | 7.3163 | 4.9526 | 6.8963 | 7.1913 |
|
77 |
+
| 0.8292 | 21.0 | 2646 | 0.9280 | 7.2896 | 4.9634 | 6.8796 | 7.1637 |
|
78 |
+
| 0.8278 | 22.0 | 2772 | 0.9261 | 7.3053 | 4.9904 | 6.8909 | 7.1665 |
|
79 |
+
| 0.8278 | 23.0 | 2898 | 0.9261 | 7.2905 | 4.9755 | 6.8948 | 7.1735 |
|
80 |
+
| 0.8157 | 24.0 | 3024 | 0.9250 | 7.2759 | 4.9718 | 6.8659 | 7.1682 |
|
81 |
+
| 0.8157 | 25.0 | 3150 | 0.9256 | 7.2898 | 4.9395 | 6.8631 | 7.1709 |
|
82 |
+
| 0.8052 | 26.0 | 3276 | 0.9244 | 7.2833 | 4.9821 | 6.8806 | 7.1818 |
|
83 |
+
| 0.8052 | 27.0 | 3402 | 0.9240 | 7.2708 | 4.9682 | 6.8609 | 7.1558 |
|
84 |
+
| 0.8055 | 28.0 | 3528 | 0.9250 | 7.2671 | 4.9469 | 6.8412 | 7.1365 |
|
85 |
+
| 0.8055 | 29.0 | 3654 | 0.9242 | 7.264 | 4.9601 | 6.8631 | 7.1534 |
|
86 |
+
| 0.8016 | 30.0 | 3780 | 0.9248 | 7.2543 | 4.93 | 6.8455 | 7.1492 |
|
87 |
+
|
88 |
+
|
89 |
### Framework versions
|
90 |
|
91 |
- Transformers 4.45.1
|
runs/Feb21_09-27-51_minion/events.out.tfevents.1740126477.minion
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:a08882508d94cfbbf6386750fc2859ff29f2b9a9af1615beba6123db3e4066c4
|
3 |
+
size 23678
|