Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Dongwei
/
Qwen-2.5-7B_Base_Math_smalllr_newdata
like
0
Text Generation
Transformers
Safetensors
Dongwei/Math_8K_for_GRPO
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Qwen-2.5-7B_Base_Math_smalllr_newdata
Commit History
End of training
e6217d8
verified
Dongwei
commited on
Feb 13
Model save
7c36af4
verified
Dongwei
commited on
Feb 13
End of training
6692eb1
verified
Dongwei
commited on
Feb 12
Model save
345b40e
verified
Dongwei
commited on
Feb 12
initial commit
cdf5c76
verified
Dongwei
commited on
Feb 11