Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ubermenchh
/
llama3.1-8B-gsm8k-grpo
like
0
PyTorch
Safetensors
GGUF
llama
unsloth
trl
grpo
conversational
License:
mit
Model card
Files
Files and versions
Community
Deploy
Use this model
b4d6dde
llama3.1-8B-gsm8k-grpo
Ctrl+K
Ctrl+K
1 contributor
History:
2 commits
ubermenchh
Upload tokenizer
b4d6dde
verified
4 months ago
.gitattributes
Safe
1.57 kB
Upload tokenizer
4 months ago
README.md
Safe
24 Bytes
initial commit
4 months ago
special_tokens_map.json
Safe
454 Bytes
Upload tokenizer
4 months ago
tokenizer.json
Safe
17.2 MB
LFS
Upload tokenizer
4 months ago
tokenizer_config.json
Safe
55.5 kB
Upload tokenizer
4 months ago