Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
SWY666
/
Qwen-2.5-7B-Simple-RL
like
0
Text Generation
Transformers
Safetensors
DigitalLearningGmbH/MATH-lighteval
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Qwen-2.5-7B-Simple-RL
Commit History
End of training
82a1938
verified
SWY666
commited on
Feb 18
Model save
313a1a5
verified
SWY666
commited on
Feb 18
Model save
24aeeb7
verified
SWY666
commited on
Feb 14
initial commit
33844ff
verified
SWY666
commited on
Feb 12