GS_bert4
This model is a fine-tuned version of biblo0507/GS_bert3 on an unknown dataset. It achieves the following results on the evaluation set:
- Loss: 0.1367
- F1: 0.5947
- Precision: 0.6222
- Recall: 0.5741
Model description
More information needed
Intended uses & limitations
More information needed
Training and evaluation data
More information needed
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 5e-05
- train_batch_size: 16
- eval_batch_size: 16
- seed: 42
- optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
- lr_scheduler_type: linear
- lr_scheduler_warmup_steps: 500
- num_epochs: 350
Training results
Training Loss | Epoch | Step | Validation Loss | F1 | Precision | Recall |
---|---|---|---|---|---|---|
0.6959 | 1.0 | 45 | 0.6620 | 0.0278 | 0.0278 | 0.0278 |
0.5392 | 2.0 | 90 | 0.5095 | 0.0452 | 0.0463 | 0.0444 |
0.4324 | 3.0 | 135 | 0.3960 | 0.0429 | 0.0444 | 0.0417 |
0.3249 | 4.0 | 180 | 0.3024 | 0.0405 | 0.0426 | 0.0389 |
0.2602 | 5.0 | 225 | 0.2354 | 0.0405 | 0.0426 | 0.0389 |
0.2102 | 6.0 | 270 | 0.1962 | 0.0476 | 0.05 | 0.0458 |
0.1892 | 7.0 | 315 | 0.1765 | 0.0508 | 0.0537 | 0.0486 |
0.1762 | 8.0 | 360 | 0.1673 | 0.0632 | 0.0667 | 0.0606 |
0.169 | 9.0 | 405 | 0.1630 | 0.0659 | 0.0704 | 0.0625 |
0.1668 | 10.0 | 450 | 0.1604 | 0.1116 | 0.1185 | 0.1065 |
0.1643 | 11.0 | 495 | 0.1575 | 0.2183 | 0.2278 | 0.2111 |
0.1608 | 12.0 | 540 | 0.1550 | 0.2534 | 0.2648 | 0.2449 |
0.156 | 13.0 | 585 | 0.1518 | 0.3138 | 0.3278 | 0.3032 |
0.1512 | 14.0 | 630 | 0.1487 | 0.3434 | 0.3593 | 0.3315 |
0.1488 | 15.0 | 675 | 0.1457 | 0.3558 | 0.3741 | 0.3421 |
0.1441 | 16.0 | 720 | 0.1430 | 0.3820 | 0.4019 | 0.3671 |
0.1386 | 17.0 | 765 | 0.1400 | 0.3862 | 0.4074 | 0.3704 |
0.1359 | 18.0 | 810 | 0.1372 | 0.4071 | 0.4278 | 0.3917 |
0.1318 | 19.0 | 855 | 0.1349 | 0.4206 | 0.4426 | 0.4042 |
0.1269 | 20.0 | 900 | 0.1320 | 0.4241 | 0.4463 | 0.4074 |
0.1263 | 21.0 | 945 | 0.1297 | 0.4497 | 0.4722 | 0.4329 |
0.123 | 22.0 | 990 | 0.1281 | 0.4484 | 0.4704 | 0.4319 |
0.1196 | 23.0 | 1035 | 0.1261 | 0.4762 | 0.5 | 0.4583 |
0.1158 | 24.0 | 1080 | 0.1234 | 0.4733 | 0.4981 | 0.4546 |
0.1113 | 25.0 | 1125 | 0.1218 | 0.4841 | 0.5074 | 0.4667 |
0.1092 | 26.0 | 1170 | 0.1195 | 0.5040 | 0.5278 | 0.4861 |
0.1029 | 27.0 | 1215 | 0.1184 | 0.5026 | 0.5259 | 0.4852 |
0.1016 | 28.0 | 1260 | 0.1166 | 0.5235 | 0.5481 | 0.5051 |
0.1002 | 29.0 | 1305 | 0.1156 | 0.5243 | 0.55 | 0.5051 |
0.0956 | 30.0 | 1350 | 0.1133 | 0.5489 | 0.5741 | 0.5301 |
0.0923 | 31.0 | 1395 | 0.1123 | 0.5394 | 0.5648 | 0.5204 |
0.0907 | 32.0 | 1440 | 0.1109 | 0.5452 | 0.5704 | 0.5264 |
0.0878 | 33.0 | 1485 | 0.1101 | 0.5418 | 0.5667 | 0.5231 |
0.0878 | 34.0 | 1530 | 0.1083 | 0.5434 | 0.5685 | 0.5245 |
0.08 | 35.0 | 1575 | 0.1065 | 0.5611 | 0.5870 | 0.5417 |
0.0784 | 36.0 | 1620 | 0.1059 | 0.5664 | 0.5926 | 0.5468 |
0.0774 | 37.0 | 1665 | 0.1046 | 0.5786 | 0.6056 | 0.5583 |
0.0765 | 38.0 | 1710 | 0.1042 | 0.5524 | 0.5778 | 0.5333 |
0.0727 | 39.0 | 1755 | 0.1034 | 0.5648 | 0.5907 | 0.5454 |
0.0678 | 40.0 | 1800 | 0.1026 | 0.5815 | 0.6074 | 0.5620 |
0.0678 | 41.0 | 1845 | 0.1014 | 0.5680 | 0.5944 | 0.5481 |
0.0658 | 42.0 | 1890 | 0.1007 | 0.5733 | 0.6 | 0.5532 |
0.0631 | 43.0 | 1935 | 0.1003 | 0.5862 | 0.6130 | 0.5662 |
0.0623 | 44.0 | 1980 | 0.1000 | 0.5778 | 0.6037 | 0.5583 |
0.0581 | 45.0 | 2025 | 0.0991 | 0.5839 | 0.6111 | 0.5634 |
0.0578 | 46.0 | 2070 | 0.0997 | 0.5701 | 0.5963 | 0.5505 |
0.0561 | 47.0 | 2115 | 0.0976 | 0.5913 | 0.6185 | 0.5708 |
0.0519 | 48.0 | 2160 | 0.0972 | 0.5889 | 0.6167 | 0.5681 |
0.0529 | 49.0 | 2205 | 0.0974 | 0.5659 | 0.5926 | 0.5458 |
0.0509 | 50.0 | 2250 | 0.0970 | 0.5981 | 0.6259 | 0.5773 |
0.0502 | 51.0 | 2295 | 0.0970 | 0.5862 | 0.6130 | 0.5662 |
0.0494 | 52.0 | 2340 | 0.0954 | 0.5810 | 0.6074 | 0.5611 |
0.0469 | 53.0 | 2385 | 0.0949 | 0.5923 | 0.6204 | 0.5713 |
0.0446 | 54.0 | 2430 | 0.0960 | 0.5759 | 0.6019 | 0.5565 |
0.0436 | 55.0 | 2475 | 0.0954 | 0.5862 | 0.6130 | 0.5662 |
0.043 | 56.0 | 2520 | 0.0948 | 0.5820 | 0.6093 | 0.5616 |
0.0419 | 57.0 | 2565 | 0.0949 | 0.5743 | 0.6 | 0.5551 |
0.0402 | 58.0 | 2610 | 0.0937 | 0.5847 | 0.6111 | 0.5648 |
0.04 | 59.0 | 2655 | 0.0942 | 0.5852 | 0.6130 | 0.5644 |
0.0376 | 60.0 | 2700 | 0.0939 | 0.5899 | 0.6167 | 0.5699 |
0.0363 | 61.0 | 2745 | 0.0928 | 0.5862 | 0.6130 | 0.5662 |
0.0351 | 62.0 | 2790 | 0.0937 | 0.5902 | 0.6167 | 0.5704 |
0.0352 | 63.0 | 2835 | 0.0929 | 0.5889 | 0.6167 | 0.5681 |
0.0341 | 64.0 | 2880 | 0.0934 | 0.6021 | 0.6296 | 0.5815 |
0.0322 | 65.0 | 2925 | 0.0934 | 0.5968 | 0.6241 | 0.5764 |
0.0311 | 66.0 | 2970 | 0.0940 | 0.5749 | 0.6019 | 0.5546 |
0.0318 | 67.0 | 3015 | 0.0926 | 0.5857 | 0.6130 | 0.5653 |
0.0305 | 68.0 | 3060 | 0.0929 | 0.5944 | 0.6222 | 0.5736 |
0.0303 | 69.0 | 3105 | 0.0938 | 0.5868 | 0.6130 | 0.5671 |
0.0279 | 70.0 | 3150 | 0.0938 | 0.5804 | 0.6074 | 0.5602 |
0.0283 | 71.0 | 3195 | 0.0928 | 0.5950 | 0.6222 | 0.5745 |
0.028 | 72.0 | 3240 | 0.0929 | 0.6056 | 0.6333 | 0.5847 |
0.0267 | 73.0 | 3285 | 0.0948 | 0.5825 | 0.6093 | 0.5625 |
0.0261 | 74.0 | 3330 | 0.0940 | 0.5828 | 0.6093 | 0.5630 |
0.0264 | 75.0 | 3375 | 0.0935 | 0.5876 | 0.6148 | 0.5671 |
0.025 | 76.0 | 3420 | 0.0938 | 0.5839 | 0.6111 | 0.5634 |
0.0243 | 77.0 | 3465 | 0.0935 | 0.5857 | 0.6130 | 0.5653 |
0.0237 | 78.0 | 3510 | 0.0930 | 0.5854 | 0.6130 | 0.5648 |
0.0224 | 79.0 | 3555 | 0.0942 | 0.5751 | 0.6019 | 0.5551 |
0.0231 | 80.0 | 3600 | 0.0937 | 0.5937 | 0.6204 | 0.5736 |
0.022 | 81.0 | 3645 | 0.0943 | 0.5825 | 0.6093 | 0.5625 |
0.0216 | 82.0 | 3690 | 0.0940 | 0.5923 | 0.6204 | 0.5713 |
0.0204 | 83.0 | 3735 | 0.0950 | 0.5836 | 0.6111 | 0.5630 |
0.0211 | 84.0 | 3780 | 0.0937 | 0.5960 | 0.6241 | 0.575 |
0.0202 | 85.0 | 3825 | 0.0949 | 0.5894 | 0.6167 | 0.5690 |
0.0196 | 86.0 | 3870 | 0.0950 | 0.5939 | 0.6204 | 0.5741 |
0.0189 | 87.0 | 3915 | 0.0949 | 0.5894 | 0.6167 | 0.5690 |
0.0181 | 88.0 | 3960 | 0.0953 | 0.5944 | 0.6222 | 0.5736 |
0.0177 | 89.0 | 4005 | 0.0940 | 0.5997 | 0.6278 | 0.5787 |
0.0175 | 90.0 | 4050 | 0.0943 | 0.6042 | 0.6315 | 0.5838 |
0.0171 | 91.0 | 4095 | 0.0946 | 0.6040 | 0.6315 | 0.5833 |
0.0174 | 92.0 | 4140 | 0.0947 | 0.5910 | 0.6185 | 0.5704 |
0.0164 | 93.0 | 4185 | 0.0953 | 0.5817 | 0.6093 | 0.5611 |
0.0159 | 94.0 | 4230 | 0.0959 | 0.5862 | 0.6130 | 0.5662 |
0.0158 | 95.0 | 4275 | 0.0956 | 0.6077 | 0.6352 | 0.5870 |
0.0154 | 96.0 | 4320 | 0.0958 | 0.5902 | 0.6167 | 0.5704 |
0.0155 | 97.0 | 4365 | 0.0970 | 0.5873 | 0.6148 | 0.5667 |
0.0147 | 98.0 | 4410 | 0.0955 | 0.6021 | 0.6296 | 0.5815 |
0.0147 | 99.0 | 4455 | 0.0966 | 0.6003 | 0.6278 | 0.5796 |
0.0139 | 100.0 | 4500 | 0.0973 | 0.5884 | 0.6167 | 0.5671 |
0.0142 | 101.0 | 4545 | 0.0983 | 0.5971 | 0.6241 | 0.5769 |
0.0132 | 102.0 | 4590 | 0.0977 | 0.5971 | 0.6241 | 0.5769 |
0.0134 | 103.0 | 4635 | 0.0985 | 0.5894 | 0.6167 | 0.5690 |
0.0132 | 104.0 | 4680 | 0.0971 | 0.5979 | 0.6259 | 0.5769 |
0.013 | 105.0 | 4725 | 0.0976 | 0.5913 | 0.6185 | 0.5708 |
0.0122 | 106.0 | 4770 | 0.0988 | 0.5997 | 0.6278 | 0.5787 |
0.012 | 107.0 | 4815 | 0.0983 | 0.5971 | 0.6241 | 0.5769 |
0.0123 | 108.0 | 4860 | 0.0992 | 0.5997 | 0.6278 | 0.5787 |
0.0121 | 109.0 | 4905 | 0.0990 | 0.5950 | 0.6222 | 0.5745 |
0.0118 | 110.0 | 4950 | 0.1000 | 0.5979 | 0.6259 | 0.5769 |
0.0118 | 111.0 | 4995 | 0.0992 | 0.5907 | 0.6185 | 0.5699 |
0.0115 | 112.0 | 5040 | 0.0989 | 0.6021 | 0.6296 | 0.5815 |
0.0111 | 113.0 | 5085 | 0.0998 | 0.5889 | 0.6167 | 0.5681 |
0.0114 | 114.0 | 5130 | 0.0992 | 0.5979 | 0.6259 | 0.5769 |
0.0104 | 115.0 | 5175 | 0.1000 | 0.5958 | 0.6241 | 0.5745 |
0.0098 | 116.0 | 5220 | 0.1004 | 0.5963 | 0.6241 | 0.5755 |
0.0104 | 117.0 | 5265 | 0.0998 | 0.6066 | 0.6352 | 0.5852 |
0.0099 | 118.0 | 5310 | 0.1012 | 0.5984 | 0.6259 | 0.5778 |
0.0098 | 119.0 | 5355 | 0.1013 | 0.6071 | 0.6352 | 0.5861 |
0.0094 | 120.0 | 5400 | 0.1015 | 0.5854 | 0.6130 | 0.5648 |
0.0095 | 121.0 | 5445 | 0.1018 | 0.6037 | 0.6315 | 0.5829 |
0.0091 | 122.0 | 5490 | 0.1023 | 0.6040 | 0.6315 | 0.5833 |
0.0089 | 123.0 | 5535 | 0.1014 | 0.5910 | 0.6185 | 0.5704 |
0.009 | 124.0 | 5580 | 0.1027 | 0.6034 | 0.6315 | 0.5824 |
0.009 | 125.0 | 5625 | 0.1020 | 0.5944 | 0.6222 | 0.5736 |
0.0086 | 126.0 | 5670 | 0.1027 | 0.5958 | 0.6241 | 0.5745 |
0.0087 | 127.0 | 5715 | 0.1019 | 0.5947 | 0.6222 | 0.5741 |
0.0082 | 128.0 | 5760 | 0.1024 | 0.5987 | 0.6259 | 0.5782 |
0.0082 | 129.0 | 5805 | 0.1043 | 0.5963 | 0.6241 | 0.5755 |
0.008 | 130.0 | 5850 | 0.1036 | 0.5950 | 0.6222 | 0.5745 |
0.0078 | 131.0 | 5895 | 0.1044 | 0.5947 | 0.6222 | 0.5741 |
0.0076 | 132.0 | 5940 | 0.1043 | 0.6016 | 0.6296 | 0.5806 |
0.0079 | 133.0 | 5985 | 0.1045 | 0.5852 | 0.6130 | 0.5644 |
0.0075 | 134.0 | 6030 | 0.1040 | 0.5889 | 0.6167 | 0.5681 |
0.0073 | 135.0 | 6075 | 0.1054 | 0.5825 | 0.6093 | 0.5625 |
0.0071 | 136.0 | 6120 | 0.1058 | 0.5907 | 0.6185 | 0.5699 |
0.0072 | 137.0 | 6165 | 0.1034 | 0.6053 | 0.6333 | 0.5843 |
0.0066 | 138.0 | 6210 | 0.1062 | 0.5963 | 0.6241 | 0.5755 |
0.0066 | 139.0 | 6255 | 0.1065 | 0.5913 | 0.6185 | 0.5708 |
0.0066 | 140.0 | 6300 | 0.1060 | 0.5931 | 0.6204 | 0.5727 |
0.0064 | 141.0 | 6345 | 0.1075 | 0.5929 | 0.6204 | 0.5722 |
0.0063 | 142.0 | 6390 | 0.1075 | 0.5939 | 0.6222 | 0.5727 |
0.0066 | 143.0 | 6435 | 0.1072 | 0.5923 | 0.6204 | 0.5713 |
0.0064 | 144.0 | 6480 | 0.1065 | 0.5942 | 0.6222 | 0.5731 |
0.0064 | 145.0 | 6525 | 0.1075 | 0.5966 | 0.6241 | 0.5759 |
0.0061 | 146.0 | 6570 | 0.1072 | 0.5947 | 0.6222 | 0.5741 |
0.0062 | 147.0 | 6615 | 0.1073 | 0.5910 | 0.6185 | 0.5704 |
0.006 | 148.0 | 6660 | 0.1090 | 0.5765 | 0.6037 | 0.5560 |
0.0059 | 149.0 | 6705 | 0.1071 | 0.6087 | 0.6370 | 0.5875 |
0.0055 | 150.0 | 6750 | 0.1088 | 0.5910 | 0.6185 | 0.5704 |
0.0056 | 151.0 | 6795 | 0.1096 | 0.5979 | 0.6259 | 0.5769 |
0.0055 | 152.0 | 6840 | 0.1088 | 0.5963 | 0.6241 | 0.5755 |
0.0052 | 153.0 | 6885 | 0.1088 | 0.5960 | 0.6241 | 0.575 |
0.0054 | 154.0 | 6930 | 0.1091 | 0.5889 | 0.6167 | 0.5681 |
0.0052 | 155.0 | 6975 | 0.1092 | 0.5929 | 0.6204 | 0.5722 |
0.005 | 156.0 | 7020 | 0.1101 | 0.5963 | 0.6241 | 0.5755 |
0.0049 | 157.0 | 7065 | 0.1099 | 0.5929 | 0.6204 | 0.5722 |
0.0051 | 158.0 | 7110 | 0.1118 | 0.5966 | 0.6241 | 0.5759 |
0.005 | 159.0 | 7155 | 0.1113 | 0.5892 | 0.6167 | 0.5685 |
0.0046 | 160.0 | 7200 | 0.1099 | 0.5934 | 0.6204 | 0.5731 |
0.0047 | 161.0 | 7245 | 0.1110 | 0.5981 | 0.6259 | 0.5773 |
0.0049 | 162.0 | 7290 | 0.1115 | 0.5868 | 0.6148 | 0.5657 |
0.0046 | 163.0 | 7335 | 0.1124 | 0.5907 | 0.6185 | 0.5699 |
0.0047 | 164.0 | 7380 | 0.1115 | 0.5934 | 0.6204 | 0.5731 |
0.0044 | 165.0 | 7425 | 0.1115 | 0.5989 | 0.6278 | 0.5773 |
0.0045 | 166.0 | 7470 | 0.1128 | 0.5905 | 0.6185 | 0.5694 |
0.0044 | 167.0 | 7515 | 0.1125 | 0.5934 | 0.6204 | 0.5731 |
0.0043 | 168.0 | 7560 | 0.1118 | 0.5963 | 0.6241 | 0.5755 |
0.0043 | 169.0 | 7605 | 0.1124 | 0.5944 | 0.6222 | 0.5736 |
0.004 | 170.0 | 7650 | 0.1132 | 0.5947 | 0.6222 | 0.5741 |
0.004 | 171.0 | 7695 | 0.1132 | 0.5931 | 0.6204 | 0.5727 |
0.004 | 172.0 | 7740 | 0.1132 | 0.5995 | 0.6278 | 0.5782 |
0.0039 | 173.0 | 7785 | 0.1127 | 0.6056 | 0.6333 | 0.5847 |
0.0039 | 174.0 | 7830 | 0.1137 | 0.5939 | 0.6222 | 0.5727 |
0.004 | 175.0 | 7875 | 0.1137 | 0.6056 | 0.6333 | 0.5847 |
0.0038 | 176.0 | 7920 | 0.1124 | 0.5907 | 0.6185 | 0.5699 |
0.0038 | 177.0 | 7965 | 0.1148 | 0.5989 | 0.6278 | 0.5773 |
0.0038 | 178.0 | 8010 | 0.1155 | 0.5995 | 0.6278 | 0.5782 |
0.0036 | 179.0 | 8055 | 0.1148 | 0.5923 | 0.6204 | 0.5713 |
0.0037 | 180.0 | 8100 | 0.1143 | 0.5992 | 0.6278 | 0.5778 |
0.0038 | 181.0 | 8145 | 0.1152 | 0.5958 | 0.6241 | 0.5745 |
0.0036 | 182.0 | 8190 | 0.1149 | 0.5926 | 0.6204 | 0.5718 |
0.0035 | 183.0 | 8235 | 0.1145 | 0.5942 | 0.6222 | 0.5731 |
0.0063 | 184.0 | 8280 | 0.1173 | 0.5799 | 0.6074 | 0.5593 |
0.0074 | 185.0 | 8325 | 0.1177 | 0.6011 | 0.6278 | 0.5810 |
0.0078 | 186.0 | 8370 | 0.1138 | 0.5952 | 0.6222 | 0.575 |
0.0056 | 187.0 | 8415 | 0.1150 | 0.5963 | 0.6241 | 0.5755 |
0.0038 | 188.0 | 8460 | 0.1153 | 0.5921 | 0.6185 | 0.5722 |
0.0036 | 189.0 | 8505 | 0.1160 | 0.5947 | 0.6222 | 0.5741 |
0.0034 | 190.0 | 8550 | 0.1164 | 0.6003 | 0.6278 | 0.5796 |
0.0032 | 191.0 | 8595 | 0.1158 | 0.5934 | 0.6204 | 0.5731 |
0.0032 | 192.0 | 8640 | 0.1174 | 0.5995 | 0.6278 | 0.5782 |
0.0031 | 193.0 | 8685 | 0.1177 | 0.5974 | 0.6259 | 0.5759 |
0.003 | 194.0 | 8730 | 0.1168 | 0.6029 | 0.6315 | 0.5815 |
0.0029 | 195.0 | 8775 | 0.1165 | 0.5979 | 0.6259 | 0.5769 |
0.0029 | 196.0 | 8820 | 0.1164 | 0.5981 | 0.6259 | 0.5773 |
0.0029 | 197.0 | 8865 | 0.1181 | 0.5926 | 0.6204 | 0.5718 |
0.0029 | 198.0 | 8910 | 0.1185 | 0.5907 | 0.6185 | 0.5699 |
0.0028 | 199.0 | 8955 | 0.1180 | 0.6032 | 0.6315 | 0.5819 |
0.0027 | 200.0 | 9000 | 0.1173 | 0.6013 | 0.6296 | 0.5801 |
0.0027 | 201.0 | 9045 | 0.1184 | 0.5979 | 0.6259 | 0.5769 |
0.0028 | 202.0 | 9090 | 0.1186 | 0.5942 | 0.6222 | 0.5731 |
0.0027 | 203.0 | 9135 | 0.1180 | 0.6016 | 0.6296 | 0.5806 |
0.0027 | 204.0 | 9180 | 0.1192 | 0.6013 | 0.6296 | 0.5801 |
0.0026 | 205.0 | 9225 | 0.1188 | 0.5939 | 0.6222 | 0.5727 |
0.0026 | 206.0 | 9270 | 0.1197 | 0.5997 | 0.6278 | 0.5787 |
0.0025 | 207.0 | 9315 | 0.1193 | 0.5929 | 0.6204 | 0.5722 |
0.0025 | 208.0 | 9360 | 0.1199 | 0.5963 | 0.6241 | 0.5755 |
0.0025 | 209.0 | 9405 | 0.1205 | 0.5960 | 0.6241 | 0.575 |
0.0024 | 210.0 | 9450 | 0.1208 | 0.5929 | 0.6204 | 0.5722 |
0.0023 | 211.0 | 9495 | 0.1213 | 0.5944 | 0.6222 | 0.5736 |
0.0024 | 212.0 | 9540 | 0.1214 | 0.5926 | 0.6204 | 0.5718 |
0.0024 | 213.0 | 9585 | 0.1215 | 0.6 | 0.6278 | 0.5792 |
0.0024 | 214.0 | 9630 | 0.1216 | 0.5963 | 0.6241 | 0.5755 |
0.0023 | 215.0 | 9675 | 0.1222 | 0.5913 | 0.6185 | 0.5708 |
0.0023 | 216.0 | 9720 | 0.1211 | 0.5963 | 0.6241 | 0.5755 |
0.0023 | 217.0 | 9765 | 0.1207 | 0.5981 | 0.6259 | 0.5773 |
0.0022 | 218.0 | 9810 | 0.1220 | 0.5976 | 0.6259 | 0.5764 |
0.0021 | 219.0 | 9855 | 0.1222 | 0.5979 | 0.6259 | 0.5769 |
0.0022 | 220.0 | 9900 | 0.1224 | 0.5997 | 0.6278 | 0.5787 |
0.0022 | 221.0 | 9945 | 0.1227 | 0.5966 | 0.6241 | 0.5759 |
0.002 | 222.0 | 9990 | 0.1228 | 0.5960 | 0.6241 | 0.575 |
0.002 | 223.0 | 10035 | 0.1238 | 0.5995 | 0.6278 | 0.5782 |
0.0021 | 224.0 | 10080 | 0.1226 | 0.6050 | 0.6333 | 0.5838 |
0.0021 | 225.0 | 10125 | 0.1234 | 0.6013 | 0.6296 | 0.5801 |
0.002 | 226.0 | 10170 | 0.1232 | 0.5995 | 0.6278 | 0.5782 |
0.002 | 227.0 | 10215 | 0.1234 | 0.5963 | 0.6241 | 0.5755 |
0.0021 | 228.0 | 10260 | 0.1240 | 0.6016 | 0.6296 | 0.5806 |
0.002 | 229.0 | 10305 | 0.1229 | 0.6050 | 0.6333 | 0.5838 |
0.0019 | 230.0 | 10350 | 0.1240 | 0.5997 | 0.6278 | 0.5787 |
0.0018 | 231.0 | 10395 | 0.1239 | 0.5929 | 0.6204 | 0.5722 |
0.0018 | 232.0 | 10440 | 0.1243 | 0.5944 | 0.6222 | 0.5736 |
0.0019 | 233.0 | 10485 | 0.1245 | 0.5947 | 0.6222 | 0.5741 |
0.0019 | 234.0 | 10530 | 0.1250 | 0.5997 | 0.6278 | 0.5787 |
0.0018 | 235.0 | 10575 | 0.1249 | 0.5979 | 0.6259 | 0.5769 |
0.0018 | 236.0 | 10620 | 0.1247 | 0.5997 | 0.6278 | 0.5787 |
0.0019 | 237.0 | 10665 | 0.1259 | 0.6029 | 0.6315 | 0.5815 |
0.0018 | 238.0 | 10710 | 0.1256 | 0.5981 | 0.6259 | 0.5773 |
0.0018 | 239.0 | 10755 | 0.1254 | 0.5947 | 0.6222 | 0.5741 |
0.0019 | 240.0 | 10800 | 0.1258 | 0.5963 | 0.6241 | 0.5755 |
0.0017 | 241.0 | 10845 | 0.1251 | 0.5979 | 0.6259 | 0.5769 |
0.0017 | 242.0 | 10890 | 0.1261 | 0.5984 | 0.6259 | 0.5778 |
0.0017 | 243.0 | 10935 | 0.1257 | 0.6029 | 0.6315 | 0.5815 |
0.0018 | 244.0 | 10980 | 0.1263 | 0.5929 | 0.6204 | 0.5722 |
0.0017 | 245.0 | 11025 | 0.1264 | 0.6 | 0.6278 | 0.5792 |
0.0017 | 246.0 | 11070 | 0.1273 | 0.5934 | 0.6204 | 0.5731 |
0.0017 | 247.0 | 11115 | 0.1261 | 0.6013 | 0.6296 | 0.5801 |
0.0016 | 248.0 | 11160 | 0.1268 | 0.6016 | 0.6296 | 0.5806 |
0.0016 | 249.0 | 11205 | 0.1265 | 0.6013 | 0.6296 | 0.5801 |
0.0016 | 250.0 | 11250 | 0.1277 | 0.6013 | 0.6296 | 0.5801 |
0.0016 | 251.0 | 11295 | 0.1272 | 0.6016 | 0.6296 | 0.5806 |
0.0016 | 252.0 | 11340 | 0.1279 | 0.5963 | 0.6241 | 0.5755 |
0.0016 | 253.0 | 11385 | 0.1264 | 0.5995 | 0.6278 | 0.5782 |
0.0014 | 254.0 | 11430 | 0.1265 | 0.6013 | 0.6296 | 0.5801 |
0.0015 | 255.0 | 11475 | 0.1276 | 0.6013 | 0.6296 | 0.5801 |
0.0015 | 256.0 | 11520 | 0.1276 | 0.6034 | 0.6315 | 0.5824 |
0.0016 | 257.0 | 11565 | 0.1284 | 0.5997 | 0.6278 | 0.5787 |
0.0016 | 258.0 | 11610 | 0.1284 | 0.5976 | 0.6259 | 0.5764 |
0.0016 | 259.0 | 11655 | 0.1284 | 0.5963 | 0.6241 | 0.5755 |
0.0015 | 260.0 | 11700 | 0.1287 | 0.5979 | 0.6259 | 0.5769 |
0.0014 | 261.0 | 11745 | 0.1299 | 0.5942 | 0.6222 | 0.5731 |
0.0014 | 262.0 | 11790 | 0.1284 | 0.6016 | 0.6296 | 0.5806 |
0.0015 | 263.0 | 11835 | 0.1284 | 0.5942 | 0.6222 | 0.5731 |
0.0014 | 264.0 | 11880 | 0.1288 | 0.5966 | 0.6241 | 0.5759 |
0.0014 | 265.0 | 11925 | 0.1291 | 0.6032 | 0.6315 | 0.5819 |
0.0014 | 266.0 | 11970 | 0.1292 | 0.6019 | 0.6296 | 0.5810 |
0.0014 | 267.0 | 12015 | 0.1284 | 0.6053 | 0.6333 | 0.5843 |
0.0013 | 268.0 | 12060 | 0.1295 | 0.5979 | 0.6259 | 0.5769 |
0.0015 | 269.0 | 12105 | 0.1293 | 0.6085 | 0.6370 | 0.5870 |
0.0014 | 270.0 | 12150 | 0.1290 | 0.6034 | 0.6315 | 0.5824 |
0.0013 | 271.0 | 12195 | 0.1303 | 0.5960 | 0.6241 | 0.575 |
0.0013 | 272.0 | 12240 | 0.1307 | 0.6013 | 0.6296 | 0.5801 |
0.0014 | 273.0 | 12285 | 0.1307 | 0.6032 | 0.6315 | 0.5819 |
0.0014 | 274.0 | 12330 | 0.1310 | 0.5966 | 0.6241 | 0.5759 |
0.0013 | 275.0 | 12375 | 0.1311 | 0.5997 | 0.6278 | 0.5787 |
0.0013 | 276.0 | 12420 | 0.1306 | 0.6034 | 0.6315 | 0.5824 |
0.0013 | 277.0 | 12465 | 0.1319 | 0.6021 | 0.6296 | 0.5815 |
0.0013 | 278.0 | 12510 | 0.1317 | 0.5997 | 0.6278 | 0.5787 |
0.0012 | 279.0 | 12555 | 0.1309 | 0.6016 | 0.6296 | 0.5806 |
0.0013 | 280.0 | 12600 | 0.1313 | 0.5981 | 0.6259 | 0.5773 |
0.0013 | 281.0 | 12645 | 0.1309 | 0.5958 | 0.6241 | 0.5745 |
0.0012 | 282.0 | 12690 | 0.1317 | 0.6013 | 0.6296 | 0.5801 |
0.0013 | 283.0 | 12735 | 0.1322 | 0.6011 | 0.6296 | 0.5796 |
0.0012 | 284.0 | 12780 | 0.1323 | 0.5944 | 0.6222 | 0.5736 |
0.0013 | 285.0 | 12825 | 0.1329 | 0.5929 | 0.6204 | 0.5722 |
0.0012 | 286.0 | 12870 | 0.1326 | 0.5981 | 0.6259 | 0.5773 |
0.0012 | 287.0 | 12915 | 0.1327 | 0.6013 | 0.6296 | 0.5801 |
0.0012 | 288.0 | 12960 | 0.1329 | 0.6013 | 0.6296 | 0.5801 |
0.0012 | 289.0 | 13005 | 0.1321 | 0.5963 | 0.6241 | 0.5755 |
0.0011 | 290.0 | 13050 | 0.1325 | 0.5963 | 0.6241 | 0.5755 |
0.0012 | 291.0 | 13095 | 0.1326 | 0.5944 | 0.6222 | 0.5736 |
0.0011 | 292.0 | 13140 | 0.1328 | 0.5979 | 0.6259 | 0.5769 |
0.0012 | 293.0 | 13185 | 0.1333 | 0.5981 | 0.6259 | 0.5773 |
0.0011 | 294.0 | 13230 | 0.1334 | 0.5981 | 0.6259 | 0.5773 |
0.0011 | 295.0 | 13275 | 0.1342 | 0.5907 | 0.6185 | 0.5699 |
0.0011 | 296.0 | 13320 | 0.1338 | 0.6 | 0.6278 | 0.5792 |
0.0011 | 297.0 | 13365 | 0.1333 | 0.6 | 0.6278 | 0.5792 |
0.0012 | 298.0 | 13410 | 0.1339 | 0.5997 | 0.6278 | 0.5787 |
0.0011 | 299.0 | 13455 | 0.1337 | 0.6034 | 0.6315 | 0.5824 |
0.0011 | 300.0 | 13500 | 0.1336 | 0.5944 | 0.6222 | 0.5736 |
0.0011 | 301.0 | 13545 | 0.1339 | 0.6016 | 0.6296 | 0.5806 |
0.0011 | 302.0 | 13590 | 0.1342 | 0.5963 | 0.6241 | 0.5755 |
0.0011 | 303.0 | 13635 | 0.1343 | 0.5997 | 0.6278 | 0.5787 |
0.0011 | 304.0 | 13680 | 0.1344 | 0.5981 | 0.6259 | 0.5773 |
0.001 | 305.0 | 13725 | 0.1348 | 0.6016 | 0.6296 | 0.5806 |
0.0011 | 306.0 | 13770 | 0.1347 | 0.6 | 0.6278 | 0.5792 |
0.0011 | 307.0 | 13815 | 0.1343 | 0.5979 | 0.6259 | 0.5769 |
0.0011 | 308.0 | 13860 | 0.1347 | 0.6034 | 0.6315 | 0.5824 |
0.001 | 309.0 | 13905 | 0.1347 | 0.5997 | 0.6278 | 0.5787 |
0.001 | 310.0 | 13950 | 0.1342 | 0.5997 | 0.6278 | 0.5787 |
0.001 | 311.0 | 13995 | 0.1349 | 0.6016 | 0.6296 | 0.5806 |
0.001 | 312.0 | 14040 | 0.1351 | 0.5960 | 0.6241 | 0.575 |
0.001 | 313.0 | 14085 | 0.1351 | 0.5942 | 0.6222 | 0.5731 |
0.001 | 314.0 | 14130 | 0.1354 | 0.5958 | 0.6241 | 0.5745 |
0.001 | 315.0 | 14175 | 0.1350 | 0.5963 | 0.6241 | 0.5755 |
0.0009 | 316.0 | 14220 | 0.1349 | 0.5960 | 0.6241 | 0.575 |
0.001 | 317.0 | 14265 | 0.1354 | 0.5907 | 0.6185 | 0.5699 |
0.001 | 318.0 | 14310 | 0.1352 | 0.5907 | 0.6185 | 0.5699 |
0.001 | 319.0 | 14355 | 0.1351 | 0.5942 | 0.6222 | 0.5731 |
0.001 | 320.0 | 14400 | 0.1351 | 0.5926 | 0.6204 | 0.5718 |
0.0009 | 321.0 | 14445 | 0.1356 | 0.5942 | 0.6222 | 0.5731 |
0.0011 | 322.0 | 14490 | 0.1352 | 0.5910 | 0.6185 | 0.5704 |
0.001 | 323.0 | 14535 | 0.1355 | 0.5907 | 0.6185 | 0.5699 |
0.001 | 324.0 | 14580 | 0.1359 | 0.5942 | 0.6222 | 0.5731 |
0.001 | 325.0 | 14625 | 0.1356 | 0.5960 | 0.6241 | 0.575 |
0.001 | 326.0 | 14670 | 0.1361 | 0.5960 | 0.6241 | 0.575 |
0.001 | 327.0 | 14715 | 0.1359 | 0.5947 | 0.6222 | 0.5741 |
0.001 | 328.0 | 14760 | 0.1365 | 0.5947 | 0.6222 | 0.5741 |
0.001 | 329.0 | 14805 | 0.1361 | 0.5910 | 0.6185 | 0.5704 |
0.0009 | 330.0 | 14850 | 0.1369 | 0.5892 | 0.6167 | 0.5685 |
0.001 | 331.0 | 14895 | 0.1366 | 0.5873 | 0.6148 | 0.5667 |
0.001 | 332.0 | 14940 | 0.1365 | 0.5931 | 0.6204 | 0.5727 |
0.001 | 333.0 | 14985 | 0.1363 | 0.5966 | 0.6241 | 0.5759 |
0.001 | 334.0 | 15030 | 0.1358 | 0.5944 | 0.6222 | 0.5736 |
0.0009 | 335.0 | 15075 | 0.1358 | 0.5944 | 0.6222 | 0.5736 |
0.0009 | 336.0 | 15120 | 0.1361 | 0.5944 | 0.6222 | 0.5736 |
0.001 | 337.0 | 15165 | 0.1358 | 0.5963 | 0.6241 | 0.5755 |
0.0009 | 338.0 | 15210 | 0.1362 | 0.5926 | 0.6204 | 0.5718 |
0.001 | 339.0 | 15255 | 0.1364 | 0.5944 | 0.6222 | 0.5736 |
0.001 | 340.0 | 15300 | 0.1365 | 0.5963 | 0.6241 | 0.5755 |
0.0009 | 341.0 | 15345 | 0.1367 | 0.5966 | 0.6241 | 0.5759 |
0.001 | 342.0 | 15390 | 0.1366 | 0.5929 | 0.6204 | 0.5722 |
0.0009 | 343.0 | 15435 | 0.1367 | 0.5929 | 0.6204 | 0.5722 |
0.001 | 344.0 | 15480 | 0.1367 | 0.5947 | 0.6222 | 0.5741 |
0.0009 | 345.0 | 15525 | 0.1368 | 0.5929 | 0.6204 | 0.5722 |
0.0009 | 346.0 | 15570 | 0.1368 | 0.5947 | 0.6222 | 0.5741 |
0.001 | 347.0 | 15615 | 0.1367 | 0.5947 | 0.6222 | 0.5741 |
0.0009 | 348.0 | 15660 | 0.1367 | 0.5947 | 0.6222 | 0.5741 |
0.0009 | 349.0 | 15705 | 0.1367 | 0.5947 | 0.6222 | 0.5741 |
0.0009 | 350.0 | 15750 | 0.1367 | 0.5947 | 0.6222 | 0.5741 |
Framework versions
- Transformers 4.51.0.dev0
- Pytorch 2.5.1+cu121
- Datasets 3.4.1
- Tokenizers 0.21.0
- Downloads last month
- 2
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support