Kulynych commited on
Commit
494579f
·
verified ·
1 Parent(s): 6edc4c3

End of training

Browse files
Files changed (2) hide show
  1. README.md +12 -12
  2. training_args.bin +1 -1
README.md CHANGED
@@ -16,14 +16,14 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 0.1515
20
- - Score: 3.5793
21
  - Counts: [1132, 692, 368, 143]
22
- - Totals: [1609, 1196, 784, 381]
23
  - Precisions: [70.35425730267247, 57.85953177257525, 46.93877551020408, 37.53280839895013]
24
- - Bp: 0.0692
25
- - Sys Len: 1609
26
  - Ref Len: 5907
 
 
 
27
 
28
  ## Model description
29
 
@@ -43,8 +43,8 @@ More information needed
43
 
44
  The following hyperparameters were used during training:
45
  - learning_rate: 2e-05
46
- - train_batch_size: 8
47
- - eval_batch_size: 8
48
  - seed: 42
49
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
50
  - lr_scheduler_type: linear
@@ -52,11 +52,11 @@ The following hyperparameters were used during training:
52
 
53
  ### Training results
54
 
55
- | Training Loss | Epoch | Step | Validation Loss | Score | Counts | Totals | Precisions | Bp | Sys Len | Ref Len |
56
- |:-------------:|:-----:|:----:|:---------------:|:------:|:---------------------:|:----------------------:|:------------------------------------------------------------------------------:|:------:|:-------:|:-------:|
57
- | 0.1836 | 1.0 | 464 | 0.1625 | 3.5827 | [1132, 692, 368, 143] | [1610, 1197, 785, 382] | [70.31055900621118, 57.811194653299914, 46.87898089171974, 37.43455497382199] | 0.0693 | 1610 | 5907 |
58
- | 0.1712 | 2.0 | 928 | 0.1545 | 3.6109 | [1136, 696, 371, 145] | [1610, 1197, 785, 382] | [70.55900621118012, 58.145363408521305, 47.261146496815286, 37.95811518324607] | 0.0693 | 1610 | 5907 |
59
- | 0.1626 | 3.0 | 1392 | 0.1515 | 3.5793 | [1132, 692, 368, 143] | [1609, 1196, 784, 381] | [70.35425730267247, 57.85953177257525, 46.93877551020408, 37.53280839895013] | 0.0692 | 1609 | 5907 |
60
 
61
 
62
  ### Framework versions
 
16
 
17
  This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Bp: 0.0692
 
20
  - Counts: [1132, 692, 368, 143]
21
+ - Loss: 0.1515
22
  - Precisions: [70.35425730267247, 57.85953177257525, 46.93877551020408, 37.53280839895013]
 
 
23
  - Ref Len: 5907
24
+ - Score: 3.5793
25
+ - Sys Len: 1609
26
+ - Totals: [1609, 1196, 784, 381]
27
 
28
  ## Model description
29
 
 
43
 
44
  The following hyperparameters were used during training:
45
  - learning_rate: 2e-05
46
+ - train_batch_size: 12
47
+ - eval_batch_size: 12
48
  - seed: 42
49
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
50
  - lr_scheduler_type: linear
 
52
 
53
  ### Training results
54
 
55
+ | Training Loss | Epoch | Step | Bp | Counts | Validation Loss | Precisions | Ref Len | Score | Sys Len | Totals |
56
+ |:-------------:|:-----:|:----:|:------:|:---------------------:|:---------------:|:------------------------------------------------------------------------------:|:-------:|:------:|:-------:|:----------------------:|
57
+ | 0.1836 | 1.0 | 464 | 0.0693 | [1132, 692, 368, 143] | 0.1625 | [70.31055900621118, 57.811194653299914, 46.87898089171974, 37.43455497382199] | 5907 | 3.5827 | 1610 | [1610, 1197, 785, 382] |
58
+ | 0.1712 | 2.0 | 928 | 0.0693 | [1136, 696, 371, 145] | 0.1545 | [70.55900621118012, 58.145363408521305, 47.261146496815286, 37.95811518324607] | 5907 | 3.6109 | 1610 | [1610, 1197, 785, 382] |
59
+ | 0.1626 | 3.0 | 1392 | 0.0692 | [1132, 692, 368, 143] | 0.1515 | [70.35425730267247, 57.85953177257525, 46.93877551020408, 37.53280839895013] | 5907 | 3.5793 | 1609 | [1609, 1196, 784, 381] |
60
 
61
 
62
  ### Framework versions
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ed5f6bc38561d3e71689f3b19451e0110d972efd0d5b2e55aca4dd442ff6f189
3
  size 5304
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a50f6c9e7c900ccfaf8ff56b09cc39eb240d4007334f63ea511c82f367ab7344
3
  size 5304