Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
mradermacher
/
Qwen2.5-1.5B-Code-GRPO-dense-reward-3k-GGUF
like
1
Transformers
GGUF
open-r1/verifiable-coding-problems-python
English
Generated from Trainer
open-r1
trl
grpo
conversational
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
mradermacher
commited on
6 days ago
Commit
f4250f3
·
verified
·
1 Parent(s):
67a9bf5
auto-patch README.md
Browse files
Files changed (0)
hide
show