Qwen2.5-3B-Instruct-GRPO-unsloth / model.safetensors.index.json

Commit History

Trained with Unsloth
e364b56
verified

rasdani commited on