End of training

Browse files

Files changed (5) hide show

README.md +17 -19
model.safetensors +1 -1
runs/Jun07_11-03-27_0417-111206-hkgnini8-10-45-16-12/events.out.tfevents.1717758208.0417-111206-hkgnini8-10-45-16-12.1976.4 +3 -0
runs/Jun07_11-03-27_0417-111206-hkgnini8-10-45-16-12/events.out.tfevents.1717759234.0417-111206-hkgnini8-10-45-16-12.1976.5 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -20,11 +20,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [microsoft/deberta-v3-small](https://huggingface.co/microsoft/deberta-v3-small) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1015
-- Accuracy: 0.9693
-- F1: 0.8766
-- Precision: 0.8803
-- Recall: 0.8729
 ## Model description
@@ -43,30 +43,28 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 3e-05
 - train_batch_size: 5
 - eval_batch_size: 5
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 12
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1     | Precision | Recall |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
-| No log        | 1.0   | 100  | 0.3499          | 0.875    | 0.0    | 0.0       | 0.0    |
-| No log        | 2.0   | 200  | 0.2517          | 0.9068   | 0.4211 | 0.9412    | 0.2712 |
-| No log        | 3.0   | 300  | 0.1835          | 0.9396   | 0.7077 | 0.8961    | 0.5847 |
-| No log        | 4.0   | 400  | 0.1338          | 0.9587   | 0.8219 | 0.8911    | 0.7627 |
-| 0.2507        | 5.0   | 500  | 0.1043          | 0.9640   | 0.8522 | 0.875     | 0.8305 |
-| 0.2507        | 6.0   | 600  | 0.1076          | 0.9629   | 0.8472 | 0.8739    | 0.8220 |
-| 0.2507        | 7.0   | 700  | 0.1061          | 0.9619   | 0.8475 | 0.8475    | 0.8475 |
-| 0.2507        | 8.0   | 800  | 0.1015          | 0.9693   | 0.8766 | 0.8803    | 0.8729 |
-| 0.2507        | 9.0   | 900  | 0.1099          | 0.9650   | 0.8596 | 0.8632    | 0.8559 |
-| 0.0434        | 10.0  | 1000 | 0.1101          | 0.9661   | 0.8632 | 0.8707    | 0.8559 |
-| 0.0434        | 11.0  | 1100 | 0.1054          | 0.9693   | 0.8766 | 0.8803    | 0.8729 |
-| 0.0434        | 12.0  | 1200 | 0.1066          | 0.9682   | 0.8729 | 0.8729    | 0.8729 |
 ### Framework versions

 This model is a fine-tuned version of [microsoft/deberta-v3-small](https://huggingface.co/microsoft/deberta-v3-small) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1653
+- Accuracy: 0.9454
+- F1: 0.7754
+- Precision: 0.7975
+- Recall: 0.7545
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 2e-05
 - train_batch_size: 5
 - eval_batch_size: 5
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 10
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1     | Precision | Recall |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
+| No log        | 1.0   | 90   | 0.3559          | 0.875    | 0.0    | 0.0       | 0.0    |
+| No log        | 2.0   | 180  | 0.2851          | 0.9087   | 0.4602 | 0.8814    | 0.3114 |
+| No log        | 3.0   | 270  | 0.2462          | 0.9049   | 0.4940 | 0.7381    | 0.3713 |
+| No log        | 4.0   | 360  | 0.2183          | 0.9222   | 0.6232 | 0.7890    | 0.5150 |
+| No log        | 5.0   | 450  | 0.1938          | 0.9304   | 0.6869 | 0.7846    | 0.6108 |
+| 0.2617        | 6.0   | 540  | 0.1804          | 0.9349   | 0.7129 | 0.7941    | 0.6467 |
+| 0.2617        | 7.0   | 630  | 0.1752          | 0.9364   | 0.7231 | 0.7929    | 0.6647 |
+| 0.2617        | 8.0   | 720  | 0.1719          | 0.9409   | 0.7539 | 0.7857    | 0.7246 |
+| 0.2617        | 9.0   | 810  | 0.1676          | 0.9424   | 0.7601 | 0.7922    | 0.7305 |
+| 0.2617        | 10.0  | 900  | 0.1653          | 0.9454   | 0.7754 | 0.7975    | 0.7545 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c2aac81c9df133f37908a1d33c3059c34a4b56cd9b202b5737df92fa5acb3dce
 size 567617008

 version https://git-lfs.github.com/spec/v1
+oid sha256:d92e26b2e3e14e1011918cab4d011eb7e9366b177872eb02bc1ad59b2cca6407
 size 567617008

runs/Jun07_11-03-27_0417-111206-hkgnini8-10-45-16-12/events.out.tfevents.1717758208.0417-111206-hkgnini8-10-45-16-12.1976.4 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:61b43938c0b89ca879f81b6333a25bc9a80b1794c02a3938f6206e7b750bc4ea
+size 10605

runs/Jun07_11-03-27_0417-111206-hkgnini8-10-45-16-12/events.out.tfevents.1717759234.0417-111206-hkgnini8-10-45-16-12.1976.5 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a3554fad571bfe02c442e5cb779397eaa717495a8dd27026661189e4b62818b9
+size 512

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8be92743a7c70c72c05bdbed351b751517b7154183e5deae2147d82dba51cac7
 size 5112

 version https://git-lfs.github.com/spec/v1
+oid sha256:5f88116e1eac958a6f7ca06f6122c0981d0f808a439e511b07895af9687d4b76
 size 5112