Arihant Tripathi commited on
Commit
cf22eb2
·
verified ·
1 Parent(s): 729ab8a

qwen_new_mage_all_domains_balanced_1.5

Browse files
README.md CHANGED
@@ -18,8 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [Qwen/Qwen1.5-1.8B](https://huggingface.co/Qwen/Qwen1.5-1.8B) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.1176
22
- - Accuracy: 0.9676
23
 
24
  ## Model description
25
 
@@ -50,14 +50,17 @@ The following hyperparameters were used during training:
50
 
51
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
52
  |:-------------:|:------:|:----:|:---------------:|:--------:|
53
- | 0.1748 | 0.0860 | 500 | 0.4614 | 0.8624 |
54
- | 0.1734 | 0.1720 | 1000 | 0.1477 | 0.9424 |
55
- | 0.1277 | 0.2580 | 1500 | 0.1875 | 0.9290 |
56
- | 0.1305 | 0.3441 | 2000 | 0.1707 | 0.9496 |
57
- | 0.1082 | 0.4301 | 2500 | 0.1075 | 0.9622 |
58
- | 0.1043 | 0.5161 | 3000 | 0.1235 | 0.9568 |
59
- | 0.083 | 0.6021 | 3500 | 0.1094 | 0.9712 |
60
- | 0.0729 | 0.6881 | 4000 | 0.1176 | 0.9676 |
 
 
 
61
 
62
 
63
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [Qwen/Qwen1.5-1.8B](https://huggingface.co/Qwen/Qwen1.5-1.8B) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 0.0885
22
+ - Accuracy: 0.9739
23
 
24
  ## Model description
25
 
 
50
 
51
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
52
  |:-------------:|:------:|:----:|:---------------:|:--------:|
53
+ | 0.1884 | 0.0860 | 500 | 0.3187 | 0.8822 |
54
+ | 0.1606 | 0.1720 | 1000 | 0.1563 | 0.9433 |
55
+ | 0.1284 | 0.2580 | 1500 | 0.1633 | 0.9460 |
56
+ | 0.1318 | 0.3441 | 2000 | 0.1304 | 0.9586 |
57
+ | 0.1028 | 0.4301 | 2500 | 0.1195 | 0.9640 |
58
+ | 0.1005 | 0.5161 | 3000 | 0.1192 | 0.9604 |
59
+ | 0.0824 | 0.6021 | 3500 | 0.1233 | 0.9595 |
60
+ | 0.0692 | 0.6881 | 4000 | 0.1101 | 0.9667 |
61
+ | 0.0714 | 0.7741 | 4500 | 0.0916 | 0.9721 |
62
+ | 0.0778 | 0.8601 | 5000 | 0.0834 | 0.9748 |
63
+ | 0.0836 | 0.9462 | 5500 | 0.0885 | 0.9739 |
64
 
65
 
66
  ### Framework versions
evaluation_results.json CHANGED
@@ -1,8 +1,8 @@
1
  {
2
- "eval_loss": 0.11335065215826035,
3
  "eval_accuracy": 0.9622302158273381,
4
- "eval_runtime": 28.3039,
5
- "eval_samples_per_second": 39.288,
6
- "eval_steps_per_second": 1.237,
7
- "epoch": 0.7741269568209186
8
  }
 
1
  {
2
+ "eval_loss": 0.10750104486942291,
3
  "eval_accuracy": 0.9622302158273381,
4
+ "eval_runtime": 36.0253,
5
+ "eval_samples_per_second": 30.867,
6
+ "eval_steps_per_second": 0.972,
7
+ "epoch": 0.6881128505074833
8
  }
model-00001-of-00002.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:cb24a87d1903f499d4e15cec75a447b1d140a567ef105a5b5d8b3b8e9b12feed
3
  size 4955308912
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4a50fc5caeb75d42e1a8e2726a3b81541a748f591c453bd4232ce236fb273c79
3
  size 4955308912
model-00002-of-00002.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ed6f09355cbf3b3bf4f87da909efd739ad56ea9ba9b8481ab520afaee91588ca
3
  size 1147395408
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e43e6a955d00de860f0daf1939254c9b28ba7a15fff72ca724faa14effb8a293
3
  size 1147395408