Checkpoints for models trained in https://arxiv.org/abs/2502.04463
-
daman1209arora/alpha_0.4_DeepSeek-R1-Distill-Qwen-7B
Text Generation • 8B • Updated • 21 -
daman1209arora/alpha_0.05_DeepSeek-R1-Distill-Qwen-1.5B
Text Generation • 2B • Updated • 31 -
daman1209arora/alpha_0.05_DeepSeek-R1-Distill-Qwen-7B
Text Generation • 8B • Updated • 8 -
daman1209arora/alpha_0.2_DeepSeek-R1-Distill-Qwen-1.5B
Text Generation • 2B • Updated • 2.79k