Qwen2.5-3B-Instruct-emoji-reasoning-gsm8k-lora / Qwen2_5_(3B)_GRPO_emoji_hf.ipynb
nomadicsynth's picture
Upload Qwen2_5_(3B)_GRPO_emoji_hf.ipynb
7208a57 verified
raw
history contribute delete
2.79 MB
Open in Colab