Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Tina-Yi
/
R1-Distill-Qwen-1.5B-Open-RS3-DrGRPO
like
0
Follow
Tina
54
Question Answering
PEFT
Safetensors
knoveleng/open-rs
English
Chinese
reasoning
arxiv:
2504.15777
License:
apache-2.0
Model card
Files
Files and versions
Community
Use this model
a15b172
R1-Distill-Qwen-1.5B-Open-RS3-DrGRPO
Ctrl+K
Ctrl+K
1 contributor
History:
14 commits
upup-ashton-wang
clean up
a15b172
verified
4 months ago
checkpoint-100
clean up
4 months ago
checkpoint-150
clean up
4 months ago
checkpoint-200
clean up
4 months ago
checkpoint-250
clean up
4 months ago
checkpoint-300
clean up
4 months ago
checkpoint-350
clean up
4 months ago
checkpoint-400
clean up
4 months ago
checkpoint-450
clean up
4 months ago
checkpoint-50
clean up
4 months ago
checkpoint-500
clean up
4 months ago
checkpoint-550
clean up
4 months ago
checkpoint-600
clean up
4 months ago
checkpoint-650
add post-trained ckpts from 50 to 850
4 months ago
checkpoint-700
add post-trained ckpts from 50 to 850
4 months ago
checkpoint-750
add post-trained ckpts from 50 to 850
4 months ago
checkpoint-800
add post-trained ckpts from 50 to 850
4 months ago
checkpoint-850
add post-trained ckpts from 50 to 850
4 months ago
.gitattributes
Safe
1.52 kB
initial commit
4 months ago
README.md
Safe
31 Bytes
initial commit
4 months ago