End of training
Browse files- README.md +12 -12
- training_args.bin +1 -1
README.md
CHANGED
@@ -16,14 +16,14 @@ should probably proofread and complete it, then remove this comment. -->
|
|
16 |
|
17 |
This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the None dataset.
|
18 |
It achieves the following results on the evaluation set:
|
19 |
-
-
|
20 |
-
- Score: 3.5793
|
21 |
- Counts: [1132, 692, 368, 143]
|
22 |
-
-
|
23 |
- Precisions: [70.35425730267247, 57.85953177257525, 46.93877551020408, 37.53280839895013]
|
24 |
-
- Bp: 0.0692
|
25 |
-
- Sys Len: 1609
|
26 |
- Ref Len: 5907
|
|
|
|
|
|
|
27 |
|
28 |
## Model description
|
29 |
|
@@ -43,8 +43,8 @@ More information needed
|
|
43 |
|
44 |
The following hyperparameters were used during training:
|
45 |
- learning_rate: 2e-05
|
46 |
-
- train_batch_size:
|
47 |
-
- eval_batch_size:
|
48 |
- seed: 42
|
49 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
50 |
- lr_scheduler_type: linear
|
@@ -52,11 +52,11 @@ The following hyperparameters were used during training:
|
|
52 |
|
53 |
### Training results
|
54 |
|
55 |
-
| Training Loss | Epoch | Step |
|
56 |
-
|
57 |
-
| 0.1836 | 1.0 | 464 | 0.
|
58 |
-
| 0.1712 | 2.0 | 928 | 0.
|
59 |
-
| 0.1626 | 3.0 | 1392 | 0.
|
60 |
|
61 |
|
62 |
### Framework versions
|
|
|
16 |
|
17 |
This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the None dataset.
|
18 |
It achieves the following results on the evaluation set:
|
19 |
+
- Bp: 0.0692
|
|
|
20 |
- Counts: [1132, 692, 368, 143]
|
21 |
+
- Loss: 0.1515
|
22 |
- Precisions: [70.35425730267247, 57.85953177257525, 46.93877551020408, 37.53280839895013]
|
|
|
|
|
23 |
- Ref Len: 5907
|
24 |
+
- Score: 3.5793
|
25 |
+
- Sys Len: 1609
|
26 |
+
- Totals: [1609, 1196, 784, 381]
|
27 |
|
28 |
## Model description
|
29 |
|
|
|
43 |
|
44 |
The following hyperparameters were used during training:
|
45 |
- learning_rate: 2e-05
|
46 |
+
- train_batch_size: 12
|
47 |
+
- eval_batch_size: 12
|
48 |
- seed: 42
|
49 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
50 |
- lr_scheduler_type: linear
|
|
|
52 |
|
53 |
### Training results
|
54 |
|
55 |
+
| Training Loss | Epoch | Step | Bp | Counts | Validation Loss | Precisions | Ref Len | Score | Sys Len | Totals |
|
56 |
+
|:-------------:|:-----:|:----:|:------:|:---------------------:|:---------------:|:------------------------------------------------------------------------------:|:-------:|:------:|:-------:|:----------------------:|
|
57 |
+
| 0.1836 | 1.0 | 464 | 0.0693 | [1132, 692, 368, 143] | 0.1625 | [70.31055900621118, 57.811194653299914, 46.87898089171974, 37.43455497382199] | 5907 | 3.5827 | 1610 | [1610, 1197, 785, 382] |
|
58 |
+
| 0.1712 | 2.0 | 928 | 0.0693 | [1136, 696, 371, 145] | 0.1545 | [70.55900621118012, 58.145363408521305, 47.261146496815286, 37.95811518324607] | 5907 | 3.6109 | 1610 | [1610, 1197, 785, 382] |
|
59 |
+
| 0.1626 | 3.0 | 1392 | 0.0692 | [1132, 692, 368, 143] | 0.1515 | [70.35425730267247, 57.85953177257525, 46.93877551020408, 37.53280839895013] | 5907 | 3.5793 | 1609 | [1609, 1196, 784, 381] |
|
60 |
|
61 |
|
62 |
### Framework versions
|
training_args.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 5304
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:a50f6c9e7c900ccfaf8ff56b09cc39eb240d4007334f63ea511c82f367ab7344
|
3 |
size 5304
|