Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
mradermacher
/
Qwen2.5-1.5B-Code-GRPO-dense-reward-3k-GGUF
like
1
Transformers
GGUF
open-r1/verifiable-coding-problems-python
English
Generated from Trainer
open-r1
trl
grpo
conversational
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
main
Qwen2.5-1.5B-Code-GRPO-dense-reward-3k-GGUF
Commit History
auto-patch README.md
f4250f3
verified
mradermacher
commited on
6 days ago
auto-patch README.md
67a9bf5
verified
mradermacher
commited on
6 days ago
uploaded from leia
3549ea4
verified
mradermacher
commited on
6 days ago
uploaded from leia
cceb00c
verified
mradermacher
commited on
6 days ago
initial commit
a488829
verified
mradermacher
commited on
6 days ago