ReasoningEval
/

DeepSeek-R1-Distill-Qwen-7B-Huatuo-SFT-all-RL

Model card Files Files and versions Community

shengliu66 commited on 8 days ago

Commit

8d710c6

·

verified ·

1 Parent(s): 5276920

Update README.md

Files changed (1) hide show

README.md +7 -3

README.md CHANGED Viewed

@@ -1,3 +1,7 @@
----
-license: apache-2.0
----

+Base Model: ReasoningEval/DeepSeek-R1-Distill-Qwen-7B-Huatuo-SFT-all
+Training Epochs: 3
+Training Objective: RL
+Training Data: ReasoningEval/Huatuo-RL