Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
yeok
/
DeepScaleR-1.5B-Preview-GSM8K-Demo
like
0
Safetensors
qwen2
unsloth
gsm8k
4-bit precision
bitsandbytes
License:
mit
Model card
Files
Files and versions
Community
yeok
commited on
Feb 17
Commit
0228ce2
·
verified
·
1 Parent(s):
57fa2a3
Update README.md
Browse files
Files changed (1)
hide
show
README.md
+8
-5
README.md
CHANGED
Viewed
@@ -1,5 +1,8 @@
1
-
---
2
-
license: mit
3
-
tags:
4
-
- unsloth
5
-
---
1
+
---
2
+
license: mit
3
+
tags:
4
+
- unsloth
5
+
- gsm8k
6
+
---
7
+
8
+
Fine tuning experiment details at https://github.com/Yeok-c/grpo-gsm8k-demo