Qwen2.5-3B-Instruct-new-grpo-r32 / model.safetensors.index.json

Commit History

(Trained with Unsloth)
fbe8ed5
verified

erdem-erdem commited on