Qwen-2-5-7b-RTL-GRPO_LoRA / adapter_model.safetensors

Commit History

Upload model trained with Unsloth
d1ac694
verified

sonyashijin commited on