nibauman commited on
Commit
808c110
·
verified ·
1 Parent(s): 5508584

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -1
README.md CHANGED
@@ -8,6 +8,7 @@ library_name: peft
8
  <!-- Provide a quick summary of what the model is/does. -->
9
  This is the model that is used to get the paper results for the MPCxR1 Qwen2.5 3B SFT GRPO model.
10
  This model was evaluated on the 19.04.25. and trained on the 18.04.25 at 13:29:26.
11
- The base model used for this training was "nibauman/race_llm_Qwen_3B_sft"
 
12
 
13
 
 
8
  <!-- Provide a quick summary of what the model is/does. -->
9
  This is the model that is used to get the paper results for the MPCxR1 Qwen2.5 3B SFT GRPO model.
10
  This model was evaluated on the 19.04.25. and trained on the 18.04.25 at 13:29:26.
11
+ The base model used for this training was "nibauman/race_llm_Qwen_3B_sft" [here](https://huggingface.co/nibauman/race_llm_Qwen_3B_sft)
12
+ This is the wandb training: https://wandb.ai/CoRL-heist-2025/mpc_grpo/runs/osa45as5?nw=nwusernibaumaneth
13
 
14