llama3.2-3b-it-24-game-8k-qwq-r64-ps-grpo-r32-V2 / model-00001-of-00002.safetensors

Commit History

(Trained with Unsloth)
8fc96ce
verified

erdem-erdem commited on