Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ubermenchh
/
llama3.1-8B-gsm8k-grpo
like
0
PyTorch
Safetensors
GGUF
llama
unsloth
trl
grpo
conversational
License:
mit
Model card
Files
Files and versions
Community
Deploy
Use this model
main
llama3.1-8B-gsm8k-grpo
/
adapter_model.safetensors
Commit History
Trained with Unsloth
cb3df82
verified
ubermenchh
commited on
Feb 13