llama3.2-3b-it-24-game-8k-qwq-r64-ps-grpo-r32-V2 / model-00002-of-00002.safetensors

Commit History

(Trained with Unsloth)
2d9b2d9
verified

erdem-erdem commited on