MoGHenry commited on
Commit
9249d5a
·
verified ·
1 Parent(s): 52bcfe9

End of training

Browse files
Files changed (3) hide show
  1. README.md +17 -11
  2. model.safetensors +1 -1
  3. training_args.bin +1 -1
README.md CHANGED
@@ -18,8 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.9586
22
- - Accuracy: 0.7951
23
 
24
  ## Model description
25
 
@@ -38,23 +38,29 @@ More information needed
38
  ### Training hyperparameters
39
 
40
  The following hyperparameters were used during training:
41
- - learning_rate: 5e-05
42
- - train_batch_size: 16
43
- - eval_batch_size: 16
44
  - seed: 42
45
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
46
  - lr_scheduler_type: linear
47
- - num_epochs: 5
 
48
 
49
  ### Training results
50
 
51
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
52
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
53
- | No log | 1.0 | 381 | 0.4382 | 0.8207 |
54
- | 0.3996 | 2.0 | 762 | 0.5340 | 0.8122 |
55
- | 0.2537 | 3.0 | 1143 | 0.6641 | 0.8148 |
56
- | 0.1347 | 4.0 | 1524 | 0.8495 | 0.7866 |
57
- | 0.1347 | 5.0 | 1905 | 0.9586 | 0.7951 |
 
 
 
 
 
58
 
59
 
60
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 0.5396
22
+ - Accuracy: 0.8076
23
 
24
  ## Model description
25
 
 
38
  ### Training hyperparameters
39
 
40
  The following hyperparameters were used during training:
41
+ - learning_rate: 1e-05
42
+ - train_batch_size: 64
43
+ - eval_batch_size: 64
44
  - seed: 42
45
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
46
  - lr_scheduler_type: linear
47
+ - lr_scheduler_warmup_steps: 500
48
+ - num_epochs: 10
49
 
50
  ### Training results
51
 
52
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
53
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
54
+ | No log | 1.0 | 96 | 0.6598 | 0.7439 |
55
+ | No log | 2.0 | 192 | 0.4624 | 0.8011 |
56
+ | No log | 3.0 | 288 | 0.4350 | 0.8148 |
57
+ | No log | 4.0 | 384 | 0.4326 | 0.8188 |
58
+ | No log | 5.0 | 480 | 0.4331 | 0.8247 |
59
+ | 0.4631 | 6.0 | 576 | 0.4566 | 0.8227 |
60
+ | 0.4631 | 7.0 | 672 | 0.4711 | 0.8194 |
61
+ | 0.4631 | 8.0 | 768 | 0.5045 | 0.8102 |
62
+ | 0.4631 | 9.0 | 864 | 0.5400 | 0.8050 |
63
+ | 0.4631 | 10.0 | 960 | 0.5396 | 0.8076 |
64
 
65
 
66
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8f0b6f9a7a7e72a55172e7f097066d42d652b0810e90080fd5278adc6cd777e4
3
  size 267832560
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:60a2186a8f88709c12280b1a47617555b1b9fc2c7db8e24867a701001d492cde
3
  size 267832560
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b69bc4f4f54be1460cef4e774978c4bb5585e3a9d56e34e4bebad5df62437c36
3
  size 5368
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5d975a74f6bc6bf5d75dca0999400ec9291eaa22f09a4f600f66c7b8ea5372c3
3
  size 5368