Update README.md
Browse files
README.md
CHANGED
|
@@ -21,7 +21,7 @@ Light-R1-7B-DS also performed well on GPQA *without* any specific training.
|
|
| 21 |
|
| 22 |
Originated from DeepSeek-R1-Distill-Qwen-7B, Light-R1-7B-DS is further trained with only [3K SFT data](https://huggingface.co/datasets/qihoo360/Light-R1-SFTData) as we've open-sourced, demonstrating the strong applicability of the released data.
|
| 23 |
|
| 24 |
-
We are excited to release this model along with the [technical report](https://github.com/Qihoo360/Light-R1/Light-R1.pdf).
|
| 25 |
|
| 26 |
## Usage
|
| 27 |
Same as DeepSeek-R1-Distill-Qwen-7B.
|
|
|
|
| 21 |
|
| 22 |
Originated from DeepSeek-R1-Distill-Qwen-7B, Light-R1-7B-DS is further trained with only [3K SFT data](https://huggingface.co/datasets/qihoo360/Light-R1-SFTData) as we've open-sourced, demonstrating the strong applicability of the released data.
|
| 23 |
|
| 24 |
+
We are excited to release this model along with the [technical report](https://github.com/Qihoo360/Light-R1/blob/main/Light-R1.pdf).
|
| 25 |
|
| 26 |
## Usage
|
| 27 |
Same as DeepSeek-R1-Distill-Qwen-7B.
|