Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Tina-Yi
/
R1-Distill-Qwen-1.5B-Open-RS3-DrGRPO
like
0
Follow
Tina
54
Question Answering
PEFT
Safetensors
knoveleng/open-rs
English
Chinese
reasoning
arxiv:
2504.15777
License:
apache-2.0
Model card
Files
Files and versions
Community
Use this model
main
R1-Distill-Qwen-1.5B-Open-RS3-DrGRPO
Commit History
Update README.md
b7ddb26
verified
upup-ashton-wang
commited on
Jul 8
Update README.md
67b8d59
verified
upup-ashton-wang
commited on
Apr 22
clean up
8959748
verified
upup-ashton-wang
commited on
Apr 22
clean up
ea4e7c8
verified
upup-ashton-wang
commited on
Apr 22
clean up
ee33b37
verified
upup-ashton-wang
commited on
Apr 22
clean up
21ab6f0
verified
upup-ashton-wang
commited on
Apr 22
clean up
c813631
verified
upup-ashton-wang
commited on
Apr 22
clean up
a15b172
verified
upup-ashton-wang
commited on
Apr 22
clean up
a9487c4
verified
upup-ashton-wang
commited on
Apr 22
clean up
704c2d1
verified
upup-ashton-wang
commited on
Apr 22
clean up
99c9d74
verified
upup-ashton-wang
commited on
Apr 22
clean up
ac2493e
verified
upup-ashton-wang
commited on
Apr 22
clean up
23b1866
verified
upup-ashton-wang
commited on
Apr 22
clean up
c7d835a
verified
upup-ashton-wang
commited on
Apr 22
clean up
7a746c5
verified
upup-ashton-wang
commited on
Apr 22
clean up
b47a08c
verified
upup-ashton-wang
commited on
Apr 22
clean up
c1f6601
verified
upup-ashton-wang
commited on
Apr 22
clean up
3acf889
verified
upup-ashton-wang
commited on
Apr 22
clean up
dbd62df
verified
upup-ashton-wang
commited on
Apr 22
add post-trained ckpts from 50 to 850
d24b266
upup-ashton-wang
commited on
Apr 8
initial commit
e29c7c1
verified
upup-ashton-wang
commited on
Apr 8