pakawadeep commited on
Commit
7939180
·
1 Parent(s): 0640a42

Training in progress epoch 27

Browse files
README.md CHANGED
@@ -15,14 +15,14 @@ probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [google/mt5-large](https://huggingface.co/google/mt5-large) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Train Loss: 0.8685
19
- - Validation Loss: 0.8795
20
- - Train Rouge1: 9.0759
21
- - Train Rouge2: 2.4257
22
  - Train Rougel: 8.9109
23
  - Train Rougelsum: 8.9109
24
- - Train Gen Len: 11.8861
25
- - Epoch: 24
26
 
27
  ## Model description
28
 
@@ -73,6 +73,9 @@ The following hyperparameters were used during training:
73
  | 0.9636 | 0.9317 | 8.5809 | 1.9307 | 8.4158 | 8.4512 | 11.8416 | 22 |
74
  | 0.9054 | 0.8921 | 8.5809 | 1.9307 | 8.4158 | 8.4512 | 11.8663 | 23 |
75
  | 0.8685 | 0.8795 | 9.0759 | 2.4257 | 8.9109 | 8.9109 | 11.8861 | 24 |
 
 
 
76
 
77
 
78
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [google/mt5-large](https://huggingface.co/google/mt5-large) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Train Loss: 0.7379
19
+ - Validation Loss: 0.8383
20
+ - Train Rouge1: 8.9816
21
+ - Train Rouge2: 2.3762
22
  - Train Rougel: 8.9109
23
  - Train Rougelsum: 8.9109
24
+ - Train Gen Len: 11.9752
25
+ - Epoch: 27
26
 
27
  ## Model description
28
 
 
73
  | 0.9636 | 0.9317 | 8.5809 | 1.9307 | 8.4158 | 8.4512 | 11.8416 | 22 |
74
  | 0.9054 | 0.8921 | 8.5809 | 1.9307 | 8.4158 | 8.4512 | 11.8663 | 23 |
75
  | 0.8685 | 0.8795 | 9.0759 | 2.4257 | 8.9109 | 8.9109 | 11.8861 | 24 |
76
+ | 0.8100 | 0.8666 | 8.9816 | 2.3762 | 8.9109 | 8.9109 | 11.9455 | 25 |
77
+ | 0.7749 | 0.8524 | 8.9816 | 2.3762 | 8.9109 | 8.9109 | 11.9505 | 26 |
78
+ | 0.7379 | 0.8383 | 8.9816 | 2.3762 | 8.9109 | 8.9109 | 11.9752 | 27 |
79
 
80
 
81
  ### Framework versions
logs/train/events.out.tfevents.1710275260.c144174c8023.1261.0.v2 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1835294dafd14929b626aa0bc48a128a7cb94fc091ad3ac694de1511431fec7d
3
- size 13288958
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d57d5a141ea56b4652d22e74a7d0b5933ad0e9f31aa21453556918df52206aa0
3
+ size 13290224
logs/validation/events.out.tfevents.1710275622.c144174c8023.1261.1.v2 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d3711e91776113999c93024c8aa9ba9db95f9280ae7f1dd8cd9a51e14a19171d
3
- size 3976
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e4e1b6c3d779b84c41524428ce398c6e0f144d4dae371c0a92d8662050cf0f2c
3
+ size 4444
tf_model.h5 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:428c1fd0af017cdb490ddd341ac7c6554a92cb2e00877aa01d1478e25c5f7b0e
3
  size 6968370776
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6a52917bf58af805496e86c633b57a7b026f2c799ed75aa2aca2e71997b0da1d
3
  size 6968370776