grpo_lora_model_qwen2.5-0.5b-it_full / adapter_model.safetensors

Commit History

Upload model trained with Unsloth
680a63f
verified

barca-boy commited on