Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
mradermacher
/
prem-1B-grpo-GGUF
like
0
Reinforcement Learning
Transformers
GGUF
openai/gsm8k
English
math
reasoning
grpo
gsm8k
conversational
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
main
prem-1B-grpo-GGUF
Ctrl+K
Ctrl+K
1 contributor
History:
4 commits
mradermacher
auto-patch README.md
2fba8cc
verified
4 months ago
.gitattributes
Safe
2.24 kB
uploaded from rain
4 months ago
README.md
Safe
3.31 kB
auto-patch README.md
4 months ago
prem-1B-grpo.IQ4_XS.gguf
Safe
610 MB
xet
uploaded from rain
4 months ago
prem-1B-grpo.Q2_K.gguf
Safe
432 MB
xet
uploaded from rain
4 months ago
prem-1B-grpo.Q3_K_L.gguf
Safe
592 MB
xet
uploaded from rain
4 months ago
prem-1B-grpo.Q3_K_M.gguf
Safe
548 MB
xet
uploaded from rain
4 months ago
prem-1B-grpo.Q3_K_S.gguf
Safe
499 MB
xet
uploaded from rain
4 months ago
prem-1B-grpo.Q4_K_M.gguf
Safe
668 MB
xet
uploaded from rain
4 months ago
prem-1B-grpo.Q4_K_S.gguf
Safe
640 MB
xet
uploaded from rain
4 months ago
prem-1B-grpo.Q5_K_M.gguf
Safe
782 MB
xet
uploaded from rain
4 months ago
prem-1B-grpo.Q5_K_S.gguf
Safe
766 MB
xet
uploaded from rain
4 months ago
prem-1B-grpo.Q6_K.gguf
Safe
903 MB
xet
uploaded from rain
4 months ago
prem-1B-grpo.Q8_0.gguf
Safe
1.17 GB
xet
uploaded from rain
4 months ago
prem-1B-grpo.f16.gguf
Safe
2.2 GB
xet
uploaded from rain
4 months ago