Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
kekema19
/
Qwen-2.5-7B-Simple-RL-0222
like
0
Text Generation
Transformers
Safetensors
DigitalLearningGmbH/MATH-lighteval
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Qwen-2.5-7B-Simple-RL-0222
Commit History
End of training
f7578e8
verified
kekema19
commited on
Feb 25
Model save
64edc27
verified
kekema19
commited on
Feb 25
Training in progress, step 201
c25f28a
verified
kekema19
commited on
Feb 25
End of training
3d978c6
verified
kekema19
commited on
Feb 24
Model save
02c15fb
verified
kekema19
commited on
Feb 24
Training in progress, step 200
3a49644
verified
kekema19
commited on
Feb 24
End of training
2f3dfb6
verified
kekema19
commited on
Feb 23
Model save
1322aa8
verified
kekema19
commited on
Feb 23
Training in progress, step 150
f7eef8d
verified
kekema19
commited on
Feb 23
End of training
323e61d
verified
kekema19
commited on
Feb 23
Model save
b1967f9
verified
kekema19
commited on
Feb 23
Training in progress, step 100
c50893d
verified
kekema19
commited on
Feb 23
End of training
b07e2cd
verified
kekema19
commited on
Feb 23
Model save
17c7f01
verified
kekema19
commited on
Feb 23
Training in progress, step 50
229a417
verified
kekema19
commited on
Feb 23
Model save
010b215
verified
kekema19
commited on
Feb 23
End of training
d9e7b2d
verified
kekema19
commited on
Feb 23
Model save
1911fc0
verified
kekema19
commited on
Feb 23
Model save
730ce95
verified
kekema19
commited on
Feb 23
Model save
a3ad1cc
verified
kekema19
commited on
Feb 22
Model save
077aac8
verified
kekema19
commited on
Feb 22
initial commit
93bd8f0
verified
kekema19
commited on
Feb 22