Arihant Tripathi commited on
Commit
7427cd3
·
verified ·
1 Parent(s): cf22eb2

qwen_new_mage_all_domains_balanced_1.5

Browse files
README.md CHANGED
@@ -18,8 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [Qwen/Qwen1.5-1.8B](https://huggingface.co/Qwen/Qwen1.5-1.8B) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.0885
22
- - Accuracy: 0.9739
23
 
24
  ## Model description
25
 
@@ -50,17 +50,17 @@ The following hyperparameters were used during training:
50
 
51
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
52
  |:-------------:|:------:|:----:|:---------------:|:--------:|
53
- | 0.1884 | 0.0860 | 500 | 0.3187 | 0.8822 |
54
- | 0.1606 | 0.1720 | 1000 | 0.1563 | 0.9433 |
55
- | 0.1284 | 0.2580 | 1500 | 0.1633 | 0.9460 |
56
- | 0.1318 | 0.3441 | 2000 | 0.1304 | 0.9586 |
57
- | 0.1028 | 0.4301 | 2500 | 0.1195 | 0.9640 |
58
- | 0.1005 | 0.5161 | 3000 | 0.1192 | 0.9604 |
59
- | 0.0824 | 0.6021 | 3500 | 0.1233 | 0.9595 |
60
- | 0.0692 | 0.6881 | 4000 | 0.1101 | 0.9667 |
61
- | 0.0714 | 0.7741 | 4500 | 0.0916 | 0.9721 |
62
- | 0.0778 | 0.8601 | 5000 | 0.0834 | 0.9748 |
63
- | 0.0836 | 0.9462 | 5500 | 0.0885 | 0.9739 |
64
 
65
 
66
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [Qwen/Qwen1.5-1.8B](https://huggingface.co/Qwen/Qwen1.5-1.8B) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 0.1085
22
+ - Accuracy: 0.9685
23
 
24
  ## Model description
25
 
 
50
 
51
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
52
  |:-------------:|:------:|:----:|:---------------:|:--------:|
53
+ | 0.2036 | 0.0860 | 500 | 0.3559 | 0.8714 |
54
+ | 0.1583 | 0.1720 | 1000 | 0.1607 | 0.9424 |
55
+ | 0.1279 | 0.2580 | 1500 | 0.1832 | 0.9308 |
56
+ | 0.1337 | 0.3441 | 2000 | 0.1520 | 0.9514 |
57
+ | 0.1144 | 0.4301 | 2500 | 0.1332 | 0.9559 |
58
+ | 0.1061 | 0.5161 | 3000 | 0.1273 | 0.9586 |
59
+ | 0.0865 | 0.6021 | 3500 | 0.1591 | 0.9514 |
60
+ | 0.0739 | 0.6881 | 4000 | 0.1265 | 0.9631 |
61
+ | 0.0716 | 0.7741 | 4500 | 0.1107 | 0.9640 |
62
+ | 0.0773 | 0.8601 | 5000 | 0.1058 | 0.9640 |
63
+ | 0.0823 | 0.9462 | 5500 | 0.1085 | 0.9685 |
64
 
65
 
66
  ### Framework versions
evaluation_results.json CHANGED
@@ -1,8 +1,8 @@
1
  {
2
- "eval_loss": 0.10750104486942291,
3
- "eval_accuracy": 0.9622302158273381,
4
- "eval_runtime": 36.0253,
5
- "eval_samples_per_second": 30.867,
6
- "eval_steps_per_second": 0.972,
7
- "epoch": 0.6881128505074833
8
  }
 
1
  {
2
+ "eval_loss": 0.0833660215139389,
3
+ "eval_accuracy": 0.9748201438848921,
4
+ "eval_runtime": 35.98,
5
+ "eval_samples_per_second": 30.906,
6
+ "eval_steps_per_second": 0.973,
7
+ "epoch": 1.0
8
  }
model-00001-of-00002.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4a50fc5caeb75d42e1a8e2726a3b81541a748f591c453bd4232ce236fb273c79
3
  size 4955308912
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:acfd94e15ed966d359f1f508f58b2b7df3aea9e274eaf84811adff3ce00a1f5d
3
  size 4955308912
model-00002-of-00002.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e43e6a955d00de860f0daf1939254c9b28ba7a15fff72ca724faa14effb8a293
3
  size 1147395408
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bda5155eba2df2b43001250844715454e76b605beff0a27ef439f996f2bd36b0
3
  size 1147395408