Kulynych
/

version_1305

Transformers

Safetensors

text2text-generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions Community

Kulynych commited on May 14

Commit

494579f

verified ·

1 Parent(s): 6edc4c3

End of training

Browse files

Files changed (2) hide show

README.md +12 -12
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -16,14 +16,14 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1515
-- Score: 3.5793
 - Counts: [1132, 692, 368, 143]
-- Totals: [1609, 1196, 784, 381]
 - Precisions: [70.35425730267247, 57.85953177257525, 46.93877551020408, 37.53280839895013]
-- Bp: 0.0692
-- Sys Len: 1609
 - Ref Len: 5907
 ## Model description
@@ -43,8 +43,8 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 8
-- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
@@ -52,11 +52,11 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Score  | Counts                | Totals                 | Precisions                                                                     | Bp     | Sys Len | Ref Len |
-|:-------------:|:-----:|:----:|:---------------:|:------:|:---------------------:|:----------------------:|:------------------------------------------------------------------------------:|:------:|:-------:|:-------:|
-| 0.1836        | 1.0   | 464  | 0.1625          | 3.5827 | [1132, 692, 368, 143] | [1610, 1197, 785, 382] | [70.31055900621118, 57.811194653299914, 46.87898089171974, 37.43455497382199]  | 0.0693 | 1610    | 5907    |
-| 0.1712        | 2.0   | 928  | 0.1545          | 3.6109 | [1136, 696, 371, 145] | [1610, 1197, 785, 382] | [70.55900621118012, 58.145363408521305, 47.261146496815286, 37.95811518324607] | 0.0693 | 1610    | 5907    |
-| 0.1626        | 3.0   | 1392 | 0.1515          | 3.5793 | [1132, 692, 368, 143] | [1609, 1196, 784, 381] | [70.35425730267247, 57.85953177257525, 46.93877551020408, 37.53280839895013]   | 0.0692 | 1609    | 5907    |
 ### Framework versions

 This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the None dataset.
 It achieves the following results on the evaluation set:
+- Bp: 0.0692
 - Counts: [1132, 692, 368, 143]
+- Loss: 0.1515
 - Precisions: [70.35425730267247, 57.85953177257525, 46.93877551020408, 37.53280839895013]
 - Ref Len: 5907
+- Score: 3.5793
+- Sys Len: 1609
+- Totals: [1609, 1196, 784, 381]
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 12
+- eval_batch_size: 12
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 ### Training results
+| Training Loss | Epoch | Step | Bp     | Counts                | Validation Loss | Precisions                                                                     | Ref Len | Score  | Sys Len | Totals                 |
+|:-------------:|:-----:|:----:|:------:|:---------------------:|:---------------:|:------------------------------------------------------------------------------:|:-------:|:------:|:-------:|:----------------------:|
+| 0.1836        | 1.0   | 464  | 0.0693 | [1132, 692, 368, 143] | 0.1625          | [70.31055900621118, 57.811194653299914, 46.87898089171974, 37.43455497382199]  | 5907    | 3.5827 | 1610    | [1610, 1197, 785, 382] |
+| 0.1712        | 2.0   | 928  | 0.0693 | [1136, 696, 371, 145] | 0.1545          | [70.55900621118012, 58.145363408521305, 47.261146496815286, 37.95811518324607] | 5907    | 3.6109 | 1610    | [1610, 1197, 785, 382] |
+| 0.1626        | 3.0   | 1392 | 0.0692 | [1132, 692, 368, 143] | 0.1515          | [70.35425730267247, 57.85953177257525, 46.93877551020408, 37.53280839895013]   | 5907    | 3.5793 | 1609    | [1609, 1196, 784, 381] |
 ### Framework versions

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ed5f6bc38561d3e71689f3b19451e0110d972efd0d5b2e55aca4dd442ff6f189
 size 5304

 version https://git-lfs.github.com/spec/v1
+oid sha256:a50f6c9e7c900ccfaf8ff56b09cc39eb240d4007334f63ea511c82f367ab7344
 size 5304