Model save
Browse files
README.md
CHANGED
@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
|
|
18 |
|
19 |
This model is a fine-tuned version of [google/codegemma-7b](https://huggingface.co/google/codegemma-7b) on the None dataset.
|
20 |
It achieves the following results on the evaluation set:
|
21 |
-
- Loss: 0.
|
22 |
|
23 |
## Model description
|
24 |
|
@@ -47,7 +47,7 @@ The following hyperparameters were used during training:
|
|
47 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
48 |
- lr_scheduler_type: cosine
|
49 |
- lr_scheduler_warmup_ratio: 0.03
|
50 |
-
- num_epochs:
|
51 |
- mixed_precision_training: Native AMP
|
52 |
|
53 |
### Training results
|
@@ -144,10 +144,47 @@ The following hyperparameters were used during training:
|
|
144 |
| 0.0766 | 4.6672 | 4400 | 0.0765 |
|
145 |
| 0.0671 | 4.7202 | 4450 | 0.0764 |
|
146 |
| 0.0651 | 4.7733 | 4500 | 0.0762 |
|
147 |
-
| 0.
|
148 |
-
| 0.
|
149 |
-
| 0.
|
150 |
-
| 0.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
151 |
|
152 |
|
153 |
### Framework versions
|
|
|
18 |
|
19 |
This model is a fine-tuned version of [google/codegemma-7b](https://huggingface.co/google/codegemma-7b) on the None dataset.
|
20 |
It achieves the following results on the evaluation set:
|
21 |
+
- Loss: 0.0737
|
22 |
|
23 |
## Model description
|
24 |
|
|
|
47 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
48 |
- lr_scheduler_type: cosine
|
49 |
- lr_scheduler_warmup_ratio: 0.03
|
50 |
+
- num_epochs: 7
|
51 |
- mixed_precision_training: Native AMP
|
52 |
|
53 |
### Training results
|
|
|
144 |
| 0.0766 | 4.6672 | 4400 | 0.0765 |
|
145 |
| 0.0671 | 4.7202 | 4450 | 0.0764 |
|
146 |
| 0.0651 | 4.7733 | 4500 | 0.0762 |
|
147 |
+
| 0.0733 | 4.8295 | 4550 | 0.0750 |
|
148 |
+
| 0.0802 | 4.8825 | 4600 | 0.0749 |
|
149 |
+
| 0.0864 | 4.9356 | 4650 | 0.0748 |
|
150 |
+
| 0.0762 | 4.9886 | 4700 | 0.0747 |
|
151 |
+
| 0.0921 | 5.0416 | 4750 | 0.0747 |
|
152 |
+
| 0.0769 | 5.0947 | 4800 | 0.0747 |
|
153 |
+
| 0.0785 | 5.1477 | 4850 | 0.0746 |
|
154 |
+
| 0.0772 | 5.2007 | 4900 | 0.0745 |
|
155 |
+
| 0.0783 | 5.2538 | 4950 | 0.0745 |
|
156 |
+
| 0.0741 | 5.3068 | 5000 | 0.0745 |
|
157 |
+
| 0.08 | 5.3599 | 5050 | 0.0744 |
|
158 |
+
| 0.0813 | 5.4129 | 5100 | 0.0744 |
|
159 |
+
| 0.0764 | 5.4659 | 5150 | 0.0744 |
|
160 |
+
| 0.0752 | 5.5190 | 5200 | 0.0743 |
|
161 |
+
| 0.0778 | 5.5720 | 5250 | 0.0743 |
|
162 |
+
| 0.0813 | 5.6250 | 5300 | 0.0743 |
|
163 |
+
| 0.0701 | 5.6781 | 5350 | 0.0743 |
|
164 |
+
| 0.071 | 5.7311 | 5400 | 0.0742 |
|
165 |
+
| 0.0764 | 5.7841 | 5450 | 0.0742 |
|
166 |
+
| 0.0846 | 5.8372 | 5500 | 0.0742 |
|
167 |
+
| 0.0738 | 5.8902 | 5550 | 0.0742 |
|
168 |
+
| 0.0748 | 5.9433 | 5600 | 0.0741 |
|
169 |
+
| 0.0781 | 5.9963 | 5650 | 0.0741 |
|
170 |
+
| 0.0739 | 6.0493 | 5700 | 0.0741 |
|
171 |
+
| 0.069 | 6.1024 | 5750 | 0.0741 |
|
172 |
+
| 0.08 | 6.1554 | 5800 | 0.0741 |
|
173 |
+
| 0.0737 | 6.2084 | 5850 | 0.0740 |
|
174 |
+
| 0.075 | 6.2615 | 5900 | 0.0740 |
|
175 |
+
| 0.0752 | 6.3145 | 5950 | 0.0740 |
|
176 |
+
| 0.0859 | 6.3675 | 6000 | 0.0739 |
|
177 |
+
| 0.0872 | 6.4206 | 6050 | 0.0739 |
|
178 |
+
| 0.0768 | 6.4736 | 6100 | 0.0739 |
|
179 |
+
| 0.0742 | 6.5267 | 6150 | 0.0739 |
|
180 |
+
| 0.0779 | 6.5797 | 6200 | 0.0739 |
|
181 |
+
| 0.072 | 6.6327 | 6250 | 0.0739 |
|
182 |
+
| 0.0717 | 6.6858 | 6300 | 0.0738 |
|
183 |
+
| 0.0735 | 6.7388 | 6350 | 0.0738 |
|
184 |
+
| 0.0787 | 6.7918 | 6400 | 0.0738 |
|
185 |
+
| 0.0792 | 6.8449 | 6450 | 0.0738 |
|
186 |
+
| 0.0743 | 6.8979 | 6500 | 0.0737 |
|
187 |
+
| 0.074 | 6.9509 | 6550 | 0.0737 |
|
188 |
|
189 |
|
190 |
### Framework versions
|
adapter_model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 4745934024
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:8fd3e303f9ed535d6b4de69bf438962a4bf013a0b00ff8a7883a4e0699d74d61
|
3 |
size 4745934024
|
runs/Sep13_17-28-28_m3h110/events.out.tfevents.1757748882.m3h110.591035.0
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:ace62e14a66be2003dbf3822246c4a955f38d911974af4ada9fb4de109f08ea2
|
3 |
+
size 60971
|