Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
leosong
/
Qwen2.5-1.5B-GRDPO
like
0
PEFT
Safetensors
Model card
Files
Files and versions
Community
Use this model
项目页面
Framework versions
项目页面
https://github.com/leosongwei/GRDPO
Framework versions
PEFT 0.14.0
Downloads last month
2
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support