koolkarni-Atharva10 commited on
Commit
cb6f061
Β·
verified Β·
1 Parent(s): b3150a4

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -0
README.md CHANGED
@@ -1,3 +1,18 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  # πŸš€ Fine-Tuning Qwen2.5-3B-Instruct with GRPO for GSM8K Dataset
2
 
3
  ## 🌟 Introduction
 
1
+ ---
2
+ tags:
3
+ - model
4
+ - fine-tuning
5
+ - reinforcement-learning
6
+ - qwen
7
+ - gsm8k
8
+ license: mit
9
+ language: en
10
+ library_name: transformers
11
+ datasets:
12
+ - gsm8k
13
+ ---
14
+
15
+
16
  # πŸš€ Fine-Tuning Qwen2.5-3B-Instruct with GRPO for GSM8K Dataset
17
 
18
  ## 🌟 Introduction