llama3.1-8B-gsm8k-grpo / adapter_model.safetensors

Commit History

Trained with Unsloth
cb3df82
verified

ubermenchh commited on