Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
zjc664656505
/
Qwen2-0.5B-GRPO-test
like
0
Transformers
TensorBoard
Safetensors
AI-MO/NuminaMath-TIR
Generated from Trainer
trl
grpo
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
Train
Deploy
Use this model
main
Qwen2-0.5B-GRPO-test
Commit History
End of training
cbdeb74
verified
zjc664656505
commited on
Mar 16
Model save
7656599
verified
zjc664656505
commited on
Mar 16
Training in progress, step 113
13574c1
verified
zjc664656505
commited on
Mar 16
Training in progress, step 110
5412c2c
verified
zjc664656505
commited on
Mar 16
Training in progress, step 100
6546f08
verified
zjc664656505
commited on
Mar 16
Training in progress, step 90
3076ae1
verified
zjc664656505
commited on
Mar 16
Training in progress, step 80
8135da7
verified
zjc664656505
commited on
Mar 16
Training in progress, step 70
dd0ede3
verified
zjc664656505
commited on
Mar 16
Training in progress, step 60
9a4c84c
verified
zjc664656505
commited on
Mar 16
Training in progress, step 50
dca83f9
verified
zjc664656505
commited on
Mar 16
Training in progress, step 40
51c86bb
verified
zjc664656505
commited on
Mar 16
Training in progress, step 30
a269e94
verified
zjc664656505
commited on
Mar 16
Training in progress, step 20
31646d8
verified
zjc664656505
commited on
Mar 16
Training in progress, step 10
6b6cfd6
verified
zjc664656505
commited on
Mar 16
initial commit
7ebc5fb
verified
zjc664656505
commited on
Mar 16