End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on the recipe_nlg dataset.
 It achieves the following results on the evaluation set:
-- Loss: 4.4498
 ## Model description
@@ -43,15 +43,23 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 2
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 5.7279        | 1.0   | 256  | 4.4997          |
-| 4.9661        | 2.0   | 512  | 4.4498          |
 ### Framework versions

 This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on the recipe_nlg dataset.
 It achieves the following results on the evaluation set:
+- Loss: 6.0133
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 10
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 7.0675        | 1.0   | 256  | 5.7808          |
+| 6.0315        | 2.0   | 512  | 5.7849          |
+| 5.8136        | 3.0   | 768  | 5.8173          |
+| 5.6857        | 4.0   | 1024 | 5.8658          |
+| 5.6426        | 5.0   | 1280 | 5.9132          |
+| 5.5854        | 6.0   | 1536 | 5.9587          |
+| 5.5609        | 7.0   | 1792 | 5.9801          |
+| 5.5196        | 8.0   | 2048 | 5.9917          |
+| 5.4844        | 9.0   | 2304 | 6.0054          |
+| 5.5273        | 10.0  | 2560 | 6.0133          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d7182d9de47dc1d785102c83ecb28958032a625cf2dd2dd4683c0c11a38362dc
 size 327657928

 version https://git-lfs.github.com/spec/v1
+oid sha256:9aff1d3203bb82d5d735eaa429ac170ad6e09d927298855afa278e7e8d194e89
 size 327657928

runs/Dec26_11-23-45_4f87fa13cf4f/events.out.tfevents.1735212228.4f87fa13cf4f.4524.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:9206086ef6b5946595131475259c9aa0cd05498a3986b6bffd25adbac7c9763c
-size 9667

 version https://git-lfs.github.com/spec/v1
+oid sha256:78b8df3c28d844f6a2653fc41d38e941ddcf946c9642fa6562cd4a88cceee2c8
+size 10503

runs/Dec26_11-23-45_4f87fa13cf4f/events.out.tfevents.1735212707.4f87fa13cf4f.4524.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:9858792ebcc70a22043255ca330b9c3af9f78b7578fa8838049d592f0f281553
+size 359