Tina-Yi
/

R1-Distill-Qwen-1.5B-Open-RS3-DrGRPO

Question Answering

Model card Files Files and versions Community

R1-Distill-Qwen-1.5B-Open-RS3-DrGRPO / checkpoint-250

Commit History

clean up

b47a08c
verified

upup-ashton-wang commited on Apr 22

add post-trained ckpts from 50 to 850

d24b266

upup-ashton-wang commited on Apr 8