Zacktree commited on
Commit
e8ad6a2
·
verified ·
1 Parent(s): 272a207

Model save

Browse files
README.md CHANGED
@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [google/codegemma-7b](https://huggingface.co/google/codegemma-7b) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.0762
22
 
23
  ## Model description
24
 
@@ -47,7 +47,7 @@ The following hyperparameters were used during training:
47
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
  - lr_scheduler_type: cosine
49
  - lr_scheduler_warmup_ratio: 0.03
50
- - num_epochs: 5
51
  - mixed_precision_training: Native AMP
52
 
53
  ### Training results
@@ -144,10 +144,47 @@ The following hyperparameters were used during training:
144
  | 0.0766 | 4.6672 | 4400 | 0.0765 |
145
  | 0.0671 | 4.7202 | 4450 | 0.0764 |
146
  | 0.0651 | 4.7733 | 4500 | 0.0762 |
147
- | 0.0621 | 4.8263 | 4550 | 0.0762 |
148
- | 0.0638 | 4.8793 | 4600 | 0.0762 |
149
- | 0.0727 | 4.9324 | 4650 | 0.0762 |
150
- | 0.059 | 4.9854 | 4700 | 0.0762 |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
151
 
152
 
153
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [google/codegemma-7b](https://huggingface.co/google/codegemma-7b) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 0.0737
22
 
23
  ## Model description
24
 
 
47
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
  - lr_scheduler_type: cosine
49
  - lr_scheduler_warmup_ratio: 0.03
50
+ - num_epochs: 7
51
  - mixed_precision_training: Native AMP
52
 
53
  ### Training results
 
144
  | 0.0766 | 4.6672 | 4400 | 0.0765 |
145
  | 0.0671 | 4.7202 | 4450 | 0.0764 |
146
  | 0.0651 | 4.7733 | 4500 | 0.0762 |
147
+ | 0.0733 | 4.8295 | 4550 | 0.0750 |
148
+ | 0.0802 | 4.8825 | 4600 | 0.0749 |
149
+ | 0.0864 | 4.9356 | 4650 | 0.0748 |
150
+ | 0.0762 | 4.9886 | 4700 | 0.0747 |
151
+ | 0.0921 | 5.0416 | 4750 | 0.0747 |
152
+ | 0.0769 | 5.0947 | 4800 | 0.0747 |
153
+ | 0.0785 | 5.1477 | 4850 | 0.0746 |
154
+ | 0.0772 | 5.2007 | 4900 | 0.0745 |
155
+ | 0.0783 | 5.2538 | 4950 | 0.0745 |
156
+ | 0.0741 | 5.3068 | 5000 | 0.0745 |
157
+ | 0.08 | 5.3599 | 5050 | 0.0744 |
158
+ | 0.0813 | 5.4129 | 5100 | 0.0744 |
159
+ | 0.0764 | 5.4659 | 5150 | 0.0744 |
160
+ | 0.0752 | 5.5190 | 5200 | 0.0743 |
161
+ | 0.0778 | 5.5720 | 5250 | 0.0743 |
162
+ | 0.0813 | 5.6250 | 5300 | 0.0743 |
163
+ | 0.0701 | 5.6781 | 5350 | 0.0743 |
164
+ | 0.071 | 5.7311 | 5400 | 0.0742 |
165
+ | 0.0764 | 5.7841 | 5450 | 0.0742 |
166
+ | 0.0846 | 5.8372 | 5500 | 0.0742 |
167
+ | 0.0738 | 5.8902 | 5550 | 0.0742 |
168
+ | 0.0748 | 5.9433 | 5600 | 0.0741 |
169
+ | 0.0781 | 5.9963 | 5650 | 0.0741 |
170
+ | 0.0739 | 6.0493 | 5700 | 0.0741 |
171
+ | 0.069 | 6.1024 | 5750 | 0.0741 |
172
+ | 0.08 | 6.1554 | 5800 | 0.0741 |
173
+ | 0.0737 | 6.2084 | 5850 | 0.0740 |
174
+ | 0.075 | 6.2615 | 5900 | 0.0740 |
175
+ | 0.0752 | 6.3145 | 5950 | 0.0740 |
176
+ | 0.0859 | 6.3675 | 6000 | 0.0739 |
177
+ | 0.0872 | 6.4206 | 6050 | 0.0739 |
178
+ | 0.0768 | 6.4736 | 6100 | 0.0739 |
179
+ | 0.0742 | 6.5267 | 6150 | 0.0739 |
180
+ | 0.0779 | 6.5797 | 6200 | 0.0739 |
181
+ | 0.072 | 6.6327 | 6250 | 0.0739 |
182
+ | 0.0717 | 6.6858 | 6300 | 0.0738 |
183
+ | 0.0735 | 6.7388 | 6350 | 0.0738 |
184
+ | 0.0787 | 6.7918 | 6400 | 0.0738 |
185
+ | 0.0792 | 6.8449 | 6450 | 0.0738 |
186
+ | 0.0743 | 6.8979 | 6500 | 0.0737 |
187
+ | 0.074 | 6.9509 | 6550 | 0.0737 |
188
 
189
 
190
  ### Framework versions
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1c7016ce49bc4fb6b1f699561d1891a56b6833a0bcec08b5d90853b6c3e9846c
3
  size 4745934024
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8fd3e303f9ed535d6b4de69bf438962a4bf013a0b00ff8a7883a4e0699d74d61
3
  size 4745934024
runs/Sep13_17-28-28_m3h110/events.out.tfevents.1757748882.m3h110.591035.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2929704e35ebbe326a6a5e1147e4c72d39c603287ee6a01b35de192fa2410bc8
3
- size 60617
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ace62e14a66be2003dbf3822246c4a955f38d911974af4ada9fb4de109f08ea2
3
+ size 60971