Xavarary commited on
Commit
b3d2ab8
·
verified ·
1 Parent(s): 02fe4e2

End of training

Browse files
README.md CHANGED
@@ -16,8 +16,8 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 2.3282
20
- - Model Preparation Time: 0.0022
21
 
22
  ## Model description
23
 
@@ -37,8 +37,8 @@ More information needed
37
 
38
  The following hyperparameters were used during training:
39
  - learning_rate: 2e-05
40
- - train_batch_size: 4
41
- - eval_batch_size: 4
42
  - seed: 42
43
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
44
  - lr_scheduler_type: linear
@@ -49,14 +49,14 @@ The following hyperparameters were used during training:
49
 
50
  | Training Loss | Epoch | Step | Validation Loss | Model Preparation Time |
51
  |:-------------:|:-----:|:----:|:---------------:|:----------------------:|
52
- | No log | 1.0 | 1522 | 2.7092 | 0.0022 |
53
- | 3.0854 | 2.0 | 3044 | 2.3807 | 0.0022 |
54
- | 3.0854 | 3.0 | 4566 | 2.3339 | 0.0022 |
55
 
56
 
57
  ### Framework versions
58
 
59
  - Transformers 4.47.1
60
- - Pytorch 2.5.1+cu124
61
  - Datasets 3.2.0
62
  - Tokenizers 0.21.0
 
16
 
17
  This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 2.4671
20
+ - Model Preparation Time: 0.0017
21
 
22
  ## Model description
23
 
 
37
 
38
  The following hyperparameters were used during training:
39
  - learning_rate: 2e-05
40
+ - train_batch_size: 16
41
+ - eval_batch_size: 16
42
  - seed: 42
43
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
44
  - lr_scheduler_type: linear
 
49
 
50
  | Training Loss | Epoch | Step | Validation Loss | Model Preparation Time |
51
  |:-------------:|:-----:|:----:|:---------------:|:----------------------:|
52
+ | 2.6493 | 1.0 | 625 | 2.4796 | 0.0017 |
53
+ | 2.5272 | 2.0 | 1250 | 2.4307 | 0.0017 |
54
+ | 2.4737 | 3.0 | 1875 | 2.4012 | 0.0017 |
55
 
56
 
57
  ### Framework versions
58
 
59
  - Transformers 4.47.1
60
+ - Pytorch 2.5.0+cu124
61
  - Datasets 3.2.0
62
  - Tokenizers 0.21.0
runs/Feb09_02-25-40_pop-os/events.out.tfevents.1739064353.pop-os.177455.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:bbbd85ef3ba1132a967aab2f4599d4235c806c22efd85b31f3f362be773b2690
3
- size 6577
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:77d6d1e6fb36b4e1c9b7a2ab939428eda323504cc03044d31a45eda73e088bb2
3
+ size 7268
runs/Feb09_02-25-40_pop-os/events.out.tfevents.1739065959.pop-os.177455.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6249235d8602b62c97344f33ef76e76725c32cb7440e297b8ff4743c324e39cd
3
+ size 425