Hoang Pham commited on
Commit
4d8c29b
·
verified ·
1 Parent(s): 8a26922

End of training

Browse files
README.md CHANGED
@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [distilbert/distilgpt2](https://huggingface.co/distilbert/distilgpt2) on the recipe_nlg dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.0517
22
 
23
  ## Model description
24
 
@@ -38,8 +38,8 @@ More information needed
38
 
39
  The following hyperparameters were used during training:
40
  - learning_rate: 2e-05
41
- - train_batch_size: 32
42
- - eval_batch_size: 32
43
  - seed: 42
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
@@ -50,8 +50,8 @@ The following hyperparameters were used during training:
50
 
51
  | Training Loss | Epoch | Step | Validation Loss |
52
  |:-------------:|:-----:|:----:|:---------------:|
53
- | No log | 1.0 | 56 | 0.1018 |
54
- | No log | 2.0 | 112 | 0.0517 |
55
 
56
 
57
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [distilbert/distilgpt2](https://huggingface.co/distilbert/distilgpt2) on the recipe_nlg dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 2.3036
22
 
23
  ## Model description
24
 
 
38
 
39
  The following hyperparameters were used during training:
40
  - learning_rate: 2e-05
41
+ - train_batch_size: 16
42
+ - eval_batch_size: 16
43
  - seed: 42
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
 
50
 
51
  | Training Loss | Epoch | Step | Validation Loss |
52
  |:-------------:|:-----:|:----:|:---------------:|
53
+ | 2.5638 | 1.0 | 112 | 2.3422 |
54
+ | 2.3798 | 2.0 | 224 | 2.3036 |
55
 
56
 
57
  ### Framework versions
runs/Dec25_10-07-35_5d2b802b7e2b/events.out.tfevents.1735121257.5d2b802b7e2b.20350.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:68e863c58671b82f9a276b725fe248aa05299b67a28c18e9da82828cad4e8869
3
- size 6020
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1f646b575be4d21fbe949785f38f3ff7ad7624dc81d97fea5ed54cb6cb8ab131
3
+ size 6645
runs/Dec25_10-07-35_5d2b802b7e2b/events.out.tfevents.1735121754.5d2b802b7e2b.20350.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:445278599d19325e24137fd2aebc45aaea6df9b3a8be9f5ed05b922f7bf58b29
3
+ size 359