Update README.md
Browse files
README.md
CHANGED
@@ -8,6 +8,7 @@ library_name: peft
|
|
8 |
<!-- Provide a quick summary of what the model is/does. -->
|
9 |
This is the model that is used to get the paper results for the MPCxR1 Qwen2.5 3B SFT GRPO model.
|
10 |
This model was evaluated on the 19.04.25. and trained on the 18.04.25 at 13:29:26.
|
11 |
-
The base model used for this training was "nibauman/race_llm_Qwen_3B_sft"
|
|
|
12 |
|
13 |
|
|
|
8 |
<!-- Provide a quick summary of what the model is/does. -->
|
9 |
This is the model that is used to get the paper results for the MPCxR1 Qwen2.5 3B SFT GRPO model.
|
10 |
This model was evaluated on the 19.04.25. and trained on the 18.04.25 at 13:29:26.
|
11 |
+
The base model used for this training was "nibauman/race_llm_Qwen_3B_sft" [here](https://huggingface.co/nibauman/race_llm_Qwen_3B_sft)
|
12 |
+
This is the wandb training: https://wandb.ai/CoRL-heist-2025/mpc_grpo/runs/osa45as5?nw=nwusernibaumaneth
|
13 |
|
14 |
|