qwen_new_mage_all_domains_balanced_1.5

Files changed (4) hide show

README.md CHANGED Viewed

@@ -18,8 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [Qwen/Qwen1.5-1.8B](https://huggingface.co/Qwen/Qwen1.5-1.8B) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1176
-- Accuracy: 0.9676
 ## Model description
@@ -50,14 +50,17 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss | Accuracy |
 |:-------------:|:------:|:----:|:---------------:|:--------:|
-| 0.1748        | 0.0860 | 500  | 0.4614          | 0.8624   |
-| 0.1734        | 0.1720 | 1000 | 0.1477          | 0.9424   |
-| 0.1277        | 0.2580 | 1500 | 0.1875          | 0.9290   |
-| 0.1305        | 0.3441 | 2000 | 0.1707          | 0.9496   |
-| 0.1082        | 0.4301 | 2500 | 0.1075          | 0.9622   |
-| 0.1043        | 0.5161 | 3000 | 0.1235          | 0.9568   |
-| 0.083         | 0.6021 | 3500 | 0.1094          | 0.9712   |
-| 0.0729        | 0.6881 | 4000 | 0.1176          | 0.9676   |
 ### Framework versions

 This model is a fine-tuned version of [Qwen/Qwen1.5-1.8B](https://huggingface.co/Qwen/Qwen1.5-1.8B) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0885
+- Accuracy: 0.9739
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss | Accuracy |
 |:-------------:|:------:|:----:|:---------------:|:--------:|
+| 0.1884        | 0.0860 | 500  | 0.3187          | 0.8822   |
+| 0.1606        | 0.1720 | 1000 | 0.1563          | 0.9433   |
+| 0.1284        | 0.2580 | 1500 | 0.1633          | 0.9460   |
+| 0.1318        | 0.3441 | 2000 | 0.1304          | 0.9586   |
+| 0.1028        | 0.4301 | 2500 | 0.1195          | 0.9640   |
+| 0.1005        | 0.5161 | 3000 | 0.1192          | 0.9604   |
+| 0.0824        | 0.6021 | 3500 | 0.1233          | 0.9595   |
+| 0.0692        | 0.6881 | 4000 | 0.1101          | 0.9667   |
+| 0.0714        | 0.7741 | 4500 | 0.0916          | 0.9721   |
+| 0.0778        | 0.8601 | 5000 | 0.0834          | 0.9748   |
+| 0.0836        | 0.9462 | 5500 | 0.0885          | 0.9739   |
 ### Framework versions

evaluation_results.json CHANGED Viewed

@@ -1,8 +1,8 @@
 {
-    "eval_loss": 0.11335065215826035,
     "eval_accuracy": 0.9622302158273381,
-    "eval_runtime": 28.3039,
-    "eval_samples_per_second": 39.288,
-    "eval_steps_per_second": 1.237,
-    "epoch": 0.7741269568209186
 }

 {
+    "eval_loss": 0.10750104486942291,
     "eval_accuracy": 0.9622302158273381,
+    "eval_runtime": 36.0253,
+    "eval_samples_per_second": 30.867,
+    "eval_steps_per_second": 0.972,
+    "epoch": 0.6881128505074833
 }

model-00001-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:cb24a87d1903f499d4e15cec75a447b1d140a567ef105a5b5d8b3b8e9b12feed
 size 4955308912

 version https://git-lfs.github.com/spec/v1
+oid sha256:4a50fc5caeb75d42e1a8e2726a3b81541a748f591c453bd4232ce236fb273c79
 size 4955308912

model-00002-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ed6f09355cbf3b3bf4f87da909efd739ad56ea9ba9b8481ab520afaee91588ca
 size 1147395408

 version https://git-lfs.github.com/spec/v1
+oid sha256:e43e6a955d00de860f0daf1939254c9b28ba7a15fff72ca724faa14effb8a293
 size 1147395408