Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
xiwenc1
/
OpenRS-GRPO-DPPv1
like
0
Text Generation
Transformers
Safetensors
knoveleng/open-rs
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
main
OpenRS-GRPO-DPPv1
Commit History
End of training
8877b63
verified
xiwenc1
commited on
Apr 23
Model save
89ed448
verified
xiwenc1
commited on
Apr 23
Training in progress, step 500
1e67e1b
verified
xiwenc1
commited on
Apr 23
Training in progress, step 450
aa148d1
verified
xiwenc1
commited on
Apr 23
Training in progress, step 400
feea8e1
verified
xiwenc1
commited on
Apr 23
Training in progress, step 350
0fd9586
verified
xiwenc1
commited on
Apr 22
Training in progress, step 300
7fb6b78
verified
xiwenc1
commited on
Apr 22
Training in progress, step 250
16fe15d
verified
xiwenc1
commited on
Apr 22
Training in progress, step 200
1535eaf
verified
xiwenc1
commited on
Apr 22
Training in progress, step 150
2d0d2a7
verified
xiwenc1
commited on
Apr 22
Training in progress, step 100
5cd6d49
verified
xiwenc1
commited on
Apr 22
Training in progress, step 50
9b1c8f6
verified
xiwenc1
commited on
Apr 22
initial commit
215d03c
verified
xiwenc1
commited on
Apr 22