End of training

Files changed (3) hide show

README.md CHANGED Viewed

@@ -16,8 +16,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.3282
-- Model Preparation Time: 0.0022
 ## Model description
@@ -37,8 +37,8 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 4
-- eval_batch_size: 4
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
@@ -49,14 +49,14 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Model Preparation Time |
 |:-------------:|:-----:|:----:|:---------------:|:----------------------:|
-| No log        | 1.0   | 1522 | 2.7092          | 0.0022                 |
-| 3.0854        | 2.0   | 3044 | 2.3807          | 0.0022                 |
-| 3.0854        | 3.0   | 4566 | 2.3339          | 0.0022                 |
 ### Framework versions
 - Transformers 4.47.1
-- Pytorch 2.5.1+cu124
 - Datasets 3.2.0
 - Tokenizers 0.21.0

 This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.4671
+- Model Preparation Time: 0.0017
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 16
+- eval_batch_size: 16
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 | Training Loss | Epoch | Step | Validation Loss | Model Preparation Time |
 |:-------------:|:-----:|:----:|:---------------:|:----------------------:|
+| 2.6493        | 1.0   | 625  | 2.4796          | 0.0017                 |
+| 2.5272        | 2.0   | 1250 | 2.4307          | 0.0017                 |
+| 2.4737        | 3.0   | 1875 | 2.4012          | 0.0017                 |
 ### Framework versions
 - Transformers 4.47.1
+- Pytorch 2.5.0+cu124
 - Datasets 3.2.0
 - Tokenizers 0.21.0

runs/Feb09_02-25-40_pop-os/events.out.tfevents.1739064353.pop-os.177455.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:bbbd85ef3ba1132a967aab2f4599d4235c806c22efd85b31f3f362be773b2690
-size 6577

 version https://git-lfs.github.com/spec/v1
+oid sha256:77d6d1e6fb36b4e1c9b7a2ab939428eda323504cc03044d31a45eda73e088bb2
+size 7268

runs/Feb09_02-25-40_pop-os/events.out.tfevents.1739065959.pop-os.177455.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:6249235d8602b62c97344f33ef76e76725c32cb7440e297b8ff4743c324e39cd
+size 425