Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
kekema19
/
Qwen-2.5-7B-Simple-RL
like
0
Text Generation
Transformers
Safetensors
DigitalLearningGmbH/MATH-lighteval
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Qwen-2.5-7B-Simple-RL
Commit History
End of training
3e1f07c
verified
kekema19
commited on
Feb 22
Model save
f169e5b
verified
kekema19
commited on
Feb 22
Training in progress, step 170
1f31d36
verified
kekema19
commited on
Feb 22
End of training
6f438d4
verified
kekema19
commited on
Feb 21
Model save
b6c3a88
verified
kekema19
commited on
Feb 21
Training in progress, step 130
fccc22f
verified
kekema19
commited on
Feb 21
End of training
24bb0c0
verified
kekema19
commited on
Feb 21
Model save
4c3d1db
verified
kekema19
commited on
Feb 21
Training in progress, step 100
c52746d
verified
kekema19
commited on
Feb 21
Model save
b9ea1e5
verified
kekema19
commited on
Feb 20
Training in progress, step 80
b0ef82c
verified
kekema19
commited on
Feb 20
Model save
9eac6d7
verified
kekema19
commited on
Feb 19
Model save
d2f930c
verified
kekema19
commited on
Feb 17
Model save
ec72473
verified
kekema19
commited on
Feb 17
initial commit
160ef6b
verified
kekema19
commited on
Feb 17