Llama-3.2-3B-Instruct-new-grpo-r32 / model.safetensors.index.json

Commit History

(Trained with Unsloth)
50a4e34
verified

erdem-erdem commited on