File size: 152 Bytes
8d710c6 |
1 2 3 4 5 6 7 |
Base Model: ReasoningEval/DeepSeek-R1-Distill-Qwen-7B-Huatuo-SFT-all Training Epochs: 3 Training Objective: RL Training Data: ReasoningEval/Huatuo-RL |
8d710c6 |
1 2 3 4 5 6 7 |
Base Model: ReasoningEval/DeepSeek-R1-Distill-Qwen-7B-Huatuo-SFT-all Training Epochs: 3 Training Objective: RL Training Data: ReasoningEval/Huatuo-RL |