parthh01 commited on
Commit
5943a9f
·
verified ·
1 Parent(s): 7f285da

Add model card

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -19,7 +19,7 @@ This model has been trained using Group Relative Policy Optimization (GRPO) to p
19
  - **Model Type**: PEFT (merged)
20
  - **Training Method**: GRPO (Group Relative Policy Optimization)
21
  - **Task**: Chess move generation with evaluation reasoning
22
- - **Source Path**: ./grpo_output/skill_3-final
23
 
24
 
25
 
 
19
  - **Model Type**: PEFT (merged)
20
  - **Training Method**: GRPO (Group Relative Policy Optimization)
21
  - **Task**: Chess move generation with evaluation reasoning
22
+ - **Source Path**: ./grpo_output/skill_6-final
23
 
24
 
25