qwen_new_mage_all_domains_1.5

Browse files

Files changed (5) hide show

README.md +20 -13
evaluation_results.json +8 -0
model-00001-of-00002.safetensors +1 -1
model-00002-of-00002.safetensors +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -18,8 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [Qwen/Qwen1.5-1.8B](https://huggingface.co/Qwen/Qwen1.5-1.8B) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.2369
-- Accuracy: 0.9083
 ## Model description
@@ -38,25 +38,32 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 2e-05
-- train_batch_size: 32
-- eval_batch_size: 32
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
-- num_epochs: 3
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss | Accuracy |
 |:-------------:|:------:|:----:|:---------------:|:--------:|
-| 0.6318        | 0.0101 | 100  | 0.4632          | 0.8561   |
-| 0.3263        | 0.0201 | 200  | 0.2369          | 0.9029   |
-| 0.313         | 0.0302 | 300  | 0.3302          | 0.8813   |
-| 0.2693        | 0.0403 | 400  | 0.2005          | 0.9083   |
-| 0.2409        | 0.0503 | 500  | 0.2083          | 0.8993   |
-| 0.229         | 0.0604 | 600  | 0.2315          | 0.9011   |
-| 0.2413        | 0.0704 | 700  | 0.2369          | 0.9083   |
 ### Framework versions

 This model is a fine-tuned version of [Qwen/Qwen1.5-1.8B](https://huggingface.co/Qwen/Qwen1.5-1.8B) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.2531
+- Accuracy: 0.9460
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 5e-06
+- train_batch_size: 8
+- eval_batch_size: 8
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- num_epochs: 1
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss | Accuracy |
 |:-------------:|:------:|:----:|:---------------:|:--------:|
+| 0.4234        | 0.0126 | 500  | 0.3614          | 0.8957   |
+| 0.2949        | 0.0252 | 1000 | 0.2974          | 0.9101   |
+| 0.3592        | 0.0377 | 1500 | 0.2913          | 0.9137   |
+| 0.3101        | 0.0503 | 2000 | 0.2877          | 0.9326   |
+| 0.2923        | 0.0629 | 2500 | 0.2246          | 0.9290   |
+| 0.2778        | 0.0755 | 3000 | 0.2472          | 0.9397   |
+| 0.2556        | 0.0881 | 3500 | 0.2163          | 0.9487   |
+| 0.2986        | 0.1006 | 4000 | 0.2156          | 0.9478   |
+| 0.272         | 0.1132 | 4500 | 0.2387          | 0.9388   |
+| 0.2363        | 0.1258 | 5000 | 0.4263          | 0.9326   |
+| 0.221         | 0.1384 | 5500 | 0.2054          | 0.9505   |
+| 0.2478        | 0.1510 | 6000 | 0.2851          | 0.9451   |
+| 0.2451        | 0.1635 | 6500 | 0.2730          | 0.9442   |
+| 0.1915        | 0.1761 | 7000 | 0.2531          | 0.9460   |
 ### Framework versions

evaluation_results.json ADDED Viewed

	@@ -0,0 +1,8 @@

+{
+    "eval_loss": 0.2368740439414978,
+    "eval_accuracy": 0.908273381294964,
+    "eval_runtime": 28.8018,
+    "eval_samples_per_second": 38.609,
+    "eval_steps_per_second": 1.215,
+    "epoch": 0.07044379591425984
+}

model-00001-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:af6790d1cfb8b9acd3fc63ced7f126b447916ca802d836cbdbdacb3117b394de
 size 4955308912

 version https://git-lfs.github.com/spec/v1
+oid sha256:0a1067afc74585ee695b85c60fc974576a8e7d4ee6eba6eeb2636a938d36e9df
 size 4955308912

model-00002-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:33eb9476761acc3b2fe9cb13af864b43bd8c00b2ef10ce65a78e009b8e8a877d
 size 1147395408

 version https://git-lfs.github.com/spec/v1
+oid sha256:0510236097662cfc2e94b028f34b8d516e8599603ea0a96488425c60ea3d9726
 size 1147395408

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:2de1ec0279d72eeb08a5ccc5261602d6ce0a7549afbfcb40f8835ec9fd89882e
 size 5304

 version https://git-lfs.github.com/spec/v1
+oid sha256:463712a41d01f91b4a9ef4304dd54394bfcad3d7d241af84139b637861a985a0
 size 5304