tanoManzo commited on
Commit
10c051e
·
verified ·
1 Parent(s): 149db23

End of training

Browse files
Files changed (2) hide show
  1. README.md +40 -26
  2. model.safetensors +1 -1
README.md CHANGED
@@ -18,13 +18,13 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [AIRI-Institute/gena-lm-bert-base-t2t-multi](https://huggingface.co/AIRI-Institute/gena-lm-bert-base-t2t-multi) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.4693
22
- - F1 Score: 0.8489
23
- - Precision: 0.8151
24
- - Recall: 0.8855
25
- - Accuracy: 0.8355
26
- - Auc: 0.8880
27
- - Prc: 0.8543
28
 
29
  ## Model description
30
 
@@ -54,25 +54,39 @@ The following hyperparameters were used during training:
54
 
55
  ### Training results
56
 
57
- | Training Loss | Epoch | Step | Validation Loss | F1 Score | Precision | Recall | Accuracy | Auc | Prc |
58
- |:-------------:|:------:|:----:|:---------------:|:--------:|:---------:|:------:|:--------:|:------:|:------:|
59
- | 0.7045 | 0.2103 | 500 | 0.6716 | 0.3608 | 0.8731 | 0.2274 | 0.5797 | 0.7448 | 0.7630 |
60
- | 0.6504 | 0.4207 | 1000 | 0.6005 | 0.7373 | 0.7184 | 0.7573 | 0.7186 | 0.7845 | 0.7959 |
61
- | 0.5759 | 0.6310 | 1500 | 0.5370 | 0.7482 | 0.7872 | 0.7129 | 0.7497 | 0.7960 | 0.7969 |
62
- | 0.5182 | 0.8414 | 2000 | 0.5214 | 0.8035 | 0.7299 | 0.8935 | 0.7720 | 0.8064 | 0.7516 |
63
- | 0.4665 | 1.0517 | 2500 | 0.4835 | 0.8199 | 0.8304 | 0.8097 | 0.8145 | 0.8676 | 0.8310 |
64
- | 0.463 | 1.2621 | 3000 | 0.4728 | 0.8318 | 0.7679 | 0.9073 | 0.8086 | 0.8709 | 0.8363 |
65
- | 0.441 | 1.4724 | 3500 | 0.4638 | 0.8316 | 0.8067 | 0.8581 | 0.8187 | 0.8770 | 0.8401 |
66
- | 0.4178 | 1.6828 | 4000 | 0.4333 | 0.8358 | 0.8040 | 0.8702 | 0.8216 | 0.8940 | 0.8833 |
67
- | 0.4165 | 1.8931 | 4500 | 0.4512 | 0.8387 | 0.8095 | 0.8702 | 0.8254 | 0.8851 | 0.8599 |
68
- | 0.4082 | 2.1035 | 5000 | 0.4773 | 0.8361 | 0.8288 | 0.8435 | 0.8275 | 0.8801 | 0.8592 |
69
- | 0.4006 | 2.3138 | 5500 | 0.4735 | 0.8453 | 0.8066 | 0.8879 | 0.8305 | 0.8766 | 0.8257 |
70
- | 0.4053 | 2.5242 | 6000 | 0.4654 | 0.8500 | 0.8033 | 0.9024 | 0.8338 | 0.8930 | 0.8661 |
71
- | 0.4101 | 2.7345 | 6500 | 0.4794 | 0.8493 | 0.8059 | 0.8976 | 0.8338 | 0.8637 | 0.8114 |
72
- | 0.4299 | 2.9449 | 7000 | 0.5050 | 0.8069 | 0.8732 | 0.75 | 0.8128 | 0.9019 | 0.8977 |
73
- | 0.3828 | 3.1552 | 7500 | 0.6362 | 0.7813 | 0.8957 | 0.6927 | 0.7976 | 0.8789 | 0.8808 |
74
- | 0.4132 | 3.3656 | 8000 | 0.4565 | 0.8484 | 0.8130 | 0.8871 | 0.8347 | 0.9009 | 0.8765 |
75
- | 0.383 | 3.5759 | 8500 | 0.4693 | 0.8489 | 0.8151 | 0.8855 | 0.8355 | 0.8880 | 0.8543 |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
76
 
77
 
78
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [AIRI-Institute/gena-lm-bert-base-t2t-multi](https://huggingface.co/AIRI-Institute/gena-lm-bert-base-t2t-multi) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 0.7191
22
+ - F1 Score: 0.8258
23
+ - Precision: 0.8725
24
+ - Recall: 0.7839
25
+ - Accuracy: 0.8275
26
+ - Auc: 0.8676
27
+ - Prc: 0.8589
28
 
29
  ## Model description
30
 
 
54
 
55
  ### Training results
56
 
57
+ | Training Loss | Epoch | Step | Validation Loss | F1 Score | Precision | Recall | Accuracy | Auc | Prc |
58
+ |:-------------:|:------:|:-----:|:---------------:|:--------:|:---------:|:------:|:--------:|:------:|:------:|
59
+ | 0.6927 | 0.2103 | 500 | 0.6550 | 0.6325 | 0.8027 | 0.5218 | 0.6836 | 0.7657 | 0.7801 |
60
+ | 0.6285 | 0.4207 | 1000 | 0.5540 | 0.7342 | 0.8129 | 0.6694 | 0.7472 | 0.8065 | 0.8058 |
61
+ | 0.5306 | 0.6310 | 1500 | 0.5262 | 0.7747 | 0.8010 | 0.75 | 0.7724 | 0.8369 | 0.8222 |
62
+ | 0.4969 | 0.8414 | 2000 | 0.4964 | 0.8208 | 0.7561 | 0.8976 | 0.7955 | 0.8721 | 0.8624 |
63
+ | 0.4722 | 1.0517 | 2500 | 0.4584 | 0.8228 | 0.8354 | 0.8105 | 0.8178 | 0.8876 | 0.8792 |
64
+ | 0.4466 | 1.2621 | 3000 | 0.4567 | 0.8424 | 0.7943 | 0.8968 | 0.8250 | 0.8896 | 0.8698 |
65
+ | 0.4418 | 1.4724 | 3500 | 0.4333 | 0.8416 | 0.8436 | 0.8395 | 0.8351 | 0.9004 | 0.8883 |
66
+ | 0.422 | 1.6828 | 4000 | 0.4661 | 0.8227 | 0.8588 | 0.7895 | 0.8225 | 0.9030 | 0.8967 |
67
+ | 0.4107 | 1.8931 | 4500 | 0.4329 | 0.8468 | 0.8009 | 0.8984 | 0.8305 | 0.8937 | 0.8585 |
68
+ | 0.3906 | 2.1035 | 5000 | 0.4643 | 0.8479 | 0.8290 | 0.8677 | 0.8376 | 0.8902 | 0.8512 |
69
+ | 0.4098 | 2.3138 | 5500 | 0.4532 | 0.8526 | 0.8060 | 0.9048 | 0.8368 | 0.8782 | 0.8309 |
70
+ | 0.4118 | 2.5242 | 6000 | 0.4862 | 0.8465 | 0.8503 | 0.8427 | 0.8406 | 0.9018 | 0.8845 |
71
+ | 0.4207 | 2.7345 | 6500 | 0.4667 | 0.8519 | 0.8126 | 0.8952 | 0.8376 | 0.8927 | 0.8561 |
72
+ | 0.4382 | 2.9449 | 7000 | 0.5130 | 0.8202 | 0.8763 | 0.7710 | 0.8237 | 0.9094 | 0.9039 |
73
+ | 0.3846 | 3.1552 | 7500 | 0.5103 | 0.8381 | 0.8659 | 0.8121 | 0.8363 | 0.9077 | 0.8992 |
74
+ | 0.4023 | 3.3656 | 8000 | 0.4508 | 0.8613 | 0.8225 | 0.9040 | 0.8481 | 0.9123 | 0.8963 |
75
+ | 0.3788 | 3.5759 | 8500 | 0.4996 | 0.8517 | 0.7933 | 0.9194 | 0.8330 | 0.8901 | 0.8517 |
76
+ | 0.3778 | 3.7863 | 9000 | 0.5016 | 0.8606 | 0.8237 | 0.9008 | 0.8477 | 0.8967 | 0.8631 |
77
+ | 0.3923 | 3.9966 | 9500 | 0.5175 | 0.8579 | 0.8356 | 0.8815 | 0.8477 | 0.8895 | 0.8575 |
78
+ | 0.3628 | 4.2070 | 10000 | 0.5557 | 0.8616 | 0.8427 | 0.8815 | 0.8523 | 0.8935 | 0.8706 |
79
+ | 0.4124 | 4.4173 | 10500 | 0.5216 | 0.8621 | 0.8252 | 0.9024 | 0.8494 | 0.8721 | 0.8318 |
80
+ | 0.388 | 4.6277 | 11000 | 0.6025 | 0.8584 | 0.8127 | 0.9097 | 0.8435 | 0.8572 | 0.8122 |
81
+ | 0.4513 | 4.8380 | 11500 | 0.5943 | 0.8500 | 0.8524 | 0.8476 | 0.8439 | 0.9012 | 0.8886 |
82
+ | 0.4206 | 5.0484 | 12000 | 0.5724 | 0.8610 | 0.8414 | 0.8815 | 0.8515 | 0.9016 | 0.8855 |
83
+ | 0.3882 | 5.2587 | 12500 | 0.5748 | 0.8616 | 0.8524 | 0.8710 | 0.8540 | 0.8901 | 0.8724 |
84
+ | 0.3756 | 5.4691 | 13000 | 0.5839 | 0.8635 | 0.8477 | 0.8798 | 0.8549 | 0.8756 | 0.8325 |
85
+ | 0.4158 | 5.6794 | 13500 | 0.5782 | 0.8593 | 0.8169 | 0.9065 | 0.8452 | 0.9048 | 0.8848 |
86
+ | 0.3859 | 5.8898 | 14000 | 0.5989 | 0.8530 | 0.8496 | 0.8565 | 0.8460 | 0.8947 | 0.8717 |
87
+ | 0.336 | 6.1001 | 14500 | 0.6641 | 0.8542 | 0.7996 | 0.9169 | 0.8368 | 0.8697 | 0.8287 |
88
+ | 0.3724 | 6.3105 | 15000 | 0.6330 | 0.8599 | 0.8205 | 0.9032 | 0.8464 | 0.8776 | 0.8500 |
89
+ | 0.3809 | 6.5208 | 15500 | 0.7191 | 0.8258 | 0.8725 | 0.7839 | 0.8275 | 0.8676 | 0.8589 |
90
 
91
 
92
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5aec86e4d218056c40a13f136c49773021b2aa34fac563b046ecaa3dd00d891d
3
  size 442503040
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:607f04e5bb97796515a8b546add6399afecc35ca54b53868ca74deefb95466bc
3
  size 442503040