Arihant Tripathi commited on
Commit
729ab8a
·
verified ·
1 Parent(s): bd3d668

qwen_new_mage_all_domains_balanced_1.5

Browse files
README.md CHANGED
@@ -18,8 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [Qwen/Qwen1.5-1.8B](https://huggingface.co/Qwen/Qwen1.5-1.8B) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.1147
22
- - Accuracy: 0.9703
23
 
24
  ## Model description
25
 
@@ -50,15 +50,14 @@ The following hyperparameters were used during training:
50
 
51
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
52
  |:-------------:|:------:|:----:|:---------------:|:--------:|
53
- | 0.1945 | 0.0860 | 500 | 0.4187 | 0.8732 |
54
- | 0.171 | 0.1720 | 1000 | 0.1541 | 0.9371 |
55
- | 0.1235 | 0.2580 | 1500 | 0.1469 | 0.9451 |
56
- | 0.1233 | 0.3441 | 2000 | 0.1433 | 0.9532 |
57
- | 0.1109 | 0.4301 | 2500 | 0.1395 | 0.9541 |
58
- | 0.108 | 0.5161 | 3000 | 0.1134 | 0.9622 |
59
- | 0.0864 | 0.6021 | 3500 | 0.1452 | 0.9649 |
60
- | 0.0743 | 0.6881 | 4000 | 0.1336 | 0.9676 |
61
- | 0.0637 | 0.7741 | 4500 | 0.1147 | 0.9703 |
62
 
63
 
64
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [Qwen/Qwen1.5-1.8B](https://huggingface.co/Qwen/Qwen1.5-1.8B) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 0.1176
22
+ - Accuracy: 0.9676
23
 
24
  ## Model description
25
 
 
50
 
51
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
52
  |:-------------:|:------:|:----:|:---------------:|:--------:|
53
+ | 0.1748 | 0.0860 | 500 | 0.4614 | 0.8624 |
54
+ | 0.1734 | 0.1720 | 1000 | 0.1477 | 0.9424 |
55
+ | 0.1277 | 0.2580 | 1500 | 0.1875 | 0.9290 |
56
+ | 0.1305 | 0.3441 | 2000 | 0.1707 | 0.9496 |
57
+ | 0.1082 | 0.4301 | 2500 | 0.1075 | 0.9622 |
58
+ | 0.1043 | 0.5161 | 3000 | 0.1235 | 0.9568 |
59
+ | 0.083 | 0.6021 | 3500 | 0.1094 | 0.9712 |
60
+ | 0.0729 | 0.6881 | 4000 | 0.1176 | 0.9676 |
 
61
 
62
 
63
  ### Framework versions
evaluation_results.json CHANGED
@@ -1,8 +1,8 @@
1
  {
2
- "eval_loss": 0.24256853759288788,
3
- "eval_accuracy": 0.9019784172661871,
4
- "eval_runtime": 35.5314,
5
- "eval_samples_per_second": 31.296,
6
- "eval_steps_per_second": 0.985,
7
- "epoch": 0.12041974883880957
8
  }
 
1
  {
2
+ "eval_loss": 0.11335065215826035,
3
+ "eval_accuracy": 0.9622302158273381,
4
+ "eval_runtime": 28.3039,
5
+ "eval_samples_per_second": 39.288,
6
+ "eval_steps_per_second": 1.237,
7
+ "epoch": 0.7741269568209186
8
  }
model-00001-of-00002.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:35a4e2c95d89066e9c03c121a80c973fd69c2e3ae6e2c593e40709687d1b3ed4
3
  size 4955308912
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cb24a87d1903f499d4e15cec75a447b1d140a567ef105a5b5d8b3b8e9b12feed
3
  size 4955308912
model-00002-of-00002.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8dd3e778230e8c8e93f4df9a72a0a124e61c394cd68abbff590dc5b9f2fc7f7b
3
  size 1147395408
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ed6f09355cbf3b3bf4f87da909efd739ad56ea9ba9b8481ab520afaee91588ca
3
  size 1147395408