Update README.md
Browse files
README.md
CHANGED
@@ -6,7 +6,7 @@ tags:
|
|
6 |
datasets:
|
7 |
- princeton-nlp/gemma2-ultrafeedback-armorm
|
8 |
model-index:
|
9 |
-
- name: gemma-2-9b-it-gmsimpo-beta10-gm0.5-tau20-lr8e-7
|
10 |
results: []
|
11 |
---
|
12 |
|
@@ -14,7 +14,7 @@ model-index:
|
|
14 |
should probably proofread and complete it, then remove this comment. -->
|
15 |
|
16 |
[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](None)
|
17 |
-
# gemma-2-9b-it-gmsimpo-beta10-gm0.5-tau20-lr8e-7
|
18 |
|
19 |
This model is a fine-tuned version of [Sunshine279/gammaPO-gemma-2-9b-it](https://huggingface.co/Sunshine279/gammaPO-gemma-2-9b-it) on the princeton-nlp/gemma2-ultrafeedback-armorm dataset.
|
20 |
It achieves the following results on the evaluation set:
|
|
|
6 |
datasets:
|
7 |
- princeton-nlp/gemma2-ultrafeedback-armorm
|
8 |
model-index:
|
9 |
+
- name: gemma-2-9b-it-gmsimpo-beta10-gm0.5-tau20-lr8e-7
|
10 |
results: []
|
11 |
---
|
12 |
|
|
|
14 |
should probably proofread and complete it, then remove this comment. -->
|
15 |
|
16 |
[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](None)
|
17 |
+
# gemma-2-9b-it-gmsimpo-beta10-gm0.5-tau20-lr8e-7
|
18 |
|
19 |
This model is a fine-tuned version of [Sunshine279/gammaPO-gemma-2-9b-it](https://huggingface.co/Sunshine279/gammaPO-gemma-2-9b-it) on the princeton-nlp/gemma2-ultrafeedback-armorm dataset.
|
20 |
It achieves the following results on the evaluation set:
|