qwen_new_mage_all_domains_balanced_1.5

Files changed (4) hide show

README.md CHANGED Viewed

@@ -18,8 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [Qwen/Qwen1.5-1.8B](https://huggingface.co/Qwen/Qwen1.5-1.8B) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.0885
-- Accuracy: 0.9739
 ## Model description
@@ -50,17 +50,17 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss | Accuracy |
 |:-------------:|:------:|:----:|:---------------:|:--------:|
-| 0.1884        | 0.0860 | 500  | 0.3187          | 0.8822   |
-| 0.1606        | 0.1720 | 1000 | 0.1563          | 0.9433   |
-| 0.1284        | 0.2580 | 1500 | 0.1633          | 0.9460   |
-| 0.1318        | 0.3441 | 2000 | 0.1304          | 0.9586   |
-| 0.1028        | 0.4301 | 2500 | 0.1195          | 0.9640   |
-| 0.1005        | 0.5161 | 3000 | 0.1192          | 0.9604   |
-| 0.0824        | 0.6021 | 3500 | 0.1233          | 0.9595   |
-| 0.0692        | 0.6881 | 4000 | 0.1101          | 0.9667   |
-| 0.0714        | 0.7741 | 4500 | 0.0916          | 0.9721   |
-| 0.0778        | 0.8601 | 5000 | 0.0834          | 0.9748   |
-| 0.0836        | 0.9462 | 5500 | 0.0885          | 0.9739   |
 ### Framework versions

 This model is a fine-tuned version of [Qwen/Qwen1.5-1.8B](https://huggingface.co/Qwen/Qwen1.5-1.8B) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1085
+- Accuracy: 0.9685
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss | Accuracy |
 |:-------------:|:------:|:----:|:---------------:|:--------:|
+| 0.2036        | 0.0860 | 500  | 0.3559          | 0.8714   |
+| 0.1583        | 0.1720 | 1000 | 0.1607          | 0.9424   |
+| 0.1279        | 0.2580 | 1500 | 0.1832          | 0.9308   |
+| 0.1337        | 0.3441 | 2000 | 0.1520          | 0.9514   |
+| 0.1144        | 0.4301 | 2500 | 0.1332          | 0.9559   |
+| 0.1061        | 0.5161 | 3000 | 0.1273          | 0.9586   |
+| 0.0865        | 0.6021 | 3500 | 0.1591          | 0.9514   |
+| 0.0739        | 0.6881 | 4000 | 0.1265          | 0.9631   |
+| 0.0716        | 0.7741 | 4500 | 0.1107          | 0.9640   |
+| 0.0773        | 0.8601 | 5000 | 0.1058          | 0.9640   |
+| 0.0823        | 0.9462 | 5500 | 0.1085          | 0.9685   |
 ### Framework versions

evaluation_results.json CHANGED Viewed

@@ -1,8 +1,8 @@
 {
-    "eval_loss": 0.10750104486942291,
-    "eval_accuracy": 0.9622302158273381,
-    "eval_runtime": 36.0253,
-    "eval_samples_per_second": 30.867,
-    "eval_steps_per_second": 0.972,
-    "epoch": 0.6881128505074833
 }

 {
+    "eval_loss": 0.0833660215139389,
+    "eval_accuracy": 0.9748201438848921,
+    "eval_runtime": 35.98,
+    "eval_samples_per_second": 30.906,
+    "eval_steps_per_second": 0.973,
+    "epoch": 1.0
 }

model-00001-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4a50fc5caeb75d42e1a8e2726a3b81541a748f591c453bd4232ce236fb273c79
 size 4955308912

 version https://git-lfs.github.com/spec/v1
+oid sha256:acfd94e15ed966d359f1f508f58b2b7df3aea9e274eaf84811adff3ce00a1f5d
 size 4955308912

model-00002-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e43e6a955d00de860f0daf1939254c9b28ba7a15fff72ca724faa14effb8a293
 size 1147395408

 version https://git-lfs.github.com/spec/v1
+oid sha256:bda5155eba2df2b43001250844715454e76b605beff0a27ef439f996f2bd36b0
 size 1147395408