ewre324 commited on
Commit
3592b13
·
verified ·
1 Parent(s): ef1d886

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -1
README.md CHANGED
@@ -6,4 +6,10 @@ base_model:
6
  ---
7
 
8
  Used Open R1 (by Huggingface) to SFT my earlier thinker models. Encouraging results.
9
- Checkpoints also present.
 
 
 
 
 
 
 
6
  ---
7
 
8
  Used Open R1 (by Huggingface) to SFT my earlier thinker models. Encouraging results.
9
+ Checkpoints also present.
10
+
11
+ https://github.com/ewre324/open-r1/tree/main
12
+
13
+
14
+ Based on DeepSeek R1 based method to train on specific reasoning dataset to ensure more thinking.
15
+ Still the <think> ... </think> tags are not generated. TODO.