--- base_model: unsloth/Qwen2.5-3B-Instruct-unsloth-bnb-4bit library_name: peft license: apache-2.0 pipeline_tag: text-generation language: - en tags: - unsloth - grpo - trl - transformers - qwen2.5 - text-generation-inference - PyTorch - gsm8k --- ### Model Description - **Developed by:** Jeesan Abbas - **License:** Apache license 2.0 - **Finetuned from model:** unsloth/Qwen2.5-3B-Instruct-unsloth-bnb-4bit # Uploaded model This qwen2 model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library. [](https://github.com/unslothai/unsloth)