Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
wooferclaw
/
Qwen-2.5-7B-Simple-RL
like
0
Text Generation
Transformers
Safetensors
open-r1/OpenR1-Math-220k
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Qwen-2.5-7B-Simple-RL
Commit History
Training in progress, step 300
3da62cf
verified
wooferclaw
commited on
Mar 15
Training in progress, step 250
dc4d089
verified
wooferclaw
commited on
Mar 15
Training in progress, step 200
4aa8bd3
verified
wooferclaw
commited on
Mar 14
Training in progress, step 150
3862101
verified
wooferclaw
commited on
Mar 14
Training in progress, step 100
e980ed2
verified
wooferclaw
commited on
Mar 14
Training in progress, step 50
46a012f
verified
wooferclaw
commited on
Mar 14
End of training
cf62c03
verified
wooferclaw
commited on
Mar 11
Model save
37e4d7c
verified
wooferclaw
commited on
Mar 11
Training in progress, step 488
9269090
verified
wooferclaw
commited on
Mar 11
Training in progress, step 400
63240ed
verified
wooferclaw
commited on
Mar 10
Training in progress, step 300
761ce17
verified
wooferclaw
commited on
Mar 10
Training in progress, step 200
7dec388
verified
wooferclaw
commited on
Mar 9
Training in progress, step 100
35808d9
verified
wooferclaw
commited on
Mar 8
Model save
3535b70
verified
wooferclaw
commited on
Mar 5
initial commit
a94d5ba
verified
wooferclaw
commited on
Feb 27