Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Tina-Yi
/
R1-Distill-Qwen-1.5B-Open-RS3-DrGRPO
like
0
Follow
Tina
54
Question Answering
PEFT
Safetensors
knoveleng/open-rs
English
Chinese
reasoning
arxiv:
2504.15777
License:
apache-2.0
Model card
Files
Files and versions
Community
Use this model
main
R1-Distill-Qwen-1.5B-Open-RS3-DrGRPO
/
checkpoint-250
Commit History
clean up
b47a08c
verified
upup-ashton-wang
commited on
Apr 22
add post-trained ckpts from 50 to 850
d24b266
upup-ashton-wang
commited on
Apr 8