Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
LLucass
/
TT_L0.2_H0.28_grpo
like
0
Text Generation
Transformers
Safetensors
knoveleng/open-rs
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
main
TT_L0.2_H0.28_grpo
Commit History
End of training
8323296
verified
LLucass
commited on
Jun 8
Model save
694e1e6
verified
LLucass
commited on
Jun 8
Training in progress, step 200, checkpoint
5076bdb
verified
LLucass
commited on
Jun 8
Training in progress, step 200
e399fd1
verified
LLucass
commited on
Jun 8
Training in progress, step 150, checkpoint
9bdba33
verified
LLucass
commited on
Jun 8
Training in progress, step 150
75da3a3
verified
LLucass
commited on
Jun 8
Training in progress, step 100, checkpoint
5b2229d
verified
LLucass
commited on
Jun 8
Training in progress, step 100
72716b9
verified
LLucass
commited on
Jun 8
Training in progress, step 50, checkpoint
60c464e
verified
LLucass
commited on
Jun 8
Training in progress, step 50
2f33322
verified
LLucass
commited on
Jun 8
initial commit
3e4761f
verified
LLucass
commited on
Jun 8