Hoang Pham commited on
Commit
0bea568
·
verified ·
1 Parent(s): bcc915c

End of training

Browse files
README.md CHANGED
@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on the recipe_nlg dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 6.0133
22
 
23
  ## Model description
24
 
@@ -43,23 +43,15 @@ The following hyperparameters were used during training:
43
  - seed: 42
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
46
- - num_epochs: 10
47
  - mixed_precision_training: Native AMP
48
 
49
  ### Training results
50
 
51
  | Training Loss | Epoch | Step | Validation Loss |
52
  |:-------------:|:-----:|:----:|:---------------:|
53
- | 7.0675 | 1.0 | 256 | 5.7808 |
54
- | 6.0315 | 2.0 | 512 | 5.7849 |
55
- | 5.8136 | 3.0 | 768 | 5.8173 |
56
- | 5.6857 | 4.0 | 1024 | 5.8658 |
57
- | 5.6426 | 5.0 | 1280 | 5.9132 |
58
- | 5.5854 | 6.0 | 1536 | 5.9587 |
59
- | 5.5609 | 7.0 | 1792 | 5.9801 |
60
- | 5.5196 | 8.0 | 2048 | 5.9917 |
61
- | 5.4844 | 9.0 | 2304 | 6.0054 |
62
- | 5.5273 | 10.0 | 2560 | 6.0133 |
63
 
64
 
65
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on the recipe_nlg dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 5.7513
22
 
23
  ## Model description
24
 
 
43
  - seed: 42
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
46
+ - num_epochs: 2
47
  - mixed_precision_training: Native AMP
48
 
49
  ### Training results
50
 
51
  | Training Loss | Epoch | Step | Validation Loss |
52
  |:-------------:|:-----:|:----:|:---------------:|
53
+ | 7.1619 | 1.0 | 256 | 5.7933 |
54
+ | 6.2992 | 2.0 | 512 | 5.7513 |
 
 
 
 
 
 
 
 
55
 
56
 
57
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f412a59fe18fc965d31bd0fe8bd912fab4e9a8189ff09b432a45f7a928a740c0
3
  size 327657928
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f1ff59d047e97e5ae883cd795f443a797e722558e5999b70b0fd0f8db91c5bdc
3
  size 327657928
runs/Dec26_11-57-44_4f87fa13cf4f/events.out.tfevents.1735214266.4f87fa13cf4f.18119.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:772cd55d2c212c53cb196d7c4111c44b9d6b2f946bfa82f29134c28ea358522b
3
- size 5810
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6c426486486968026e433e8b8079948fc6f5eb57015aec2cb067929336faf157
3
+ size 6646
runs/Dec26_11-57-44_4f87fa13cf4f/events.out.tfevents.1735214370.4f87fa13cf4f.18119.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:22858c7f8a1ab7595ac148b103cd98d86681827254616ac399f1a27a7cf9c5ee
3
+ size 359