Update README.md
Browse files
README.md
CHANGED
@@ -21,9 +21,10 @@ pipeline_tag: text-generation
|
|
21 |

|
22 |
|
23 |
# Information
|
24 |
-
Advanced, high-quality and lite reasoning for a tiny size that you can run on your phone.
|
|
|
25 |
|
26 |
-
Trained similarly to Deepseek R1, we used Smollm2 as a base model, then we've SFT fine tuned
|
27 |
|
28 |
# Format
|
29 |
```
|
|
|
21 |

|
22 |
|
23 |
# Information
|
24 |
+
Advanced, high-quality and **lite** reasoning for a tiny size that you can run on your phone.
|
25 |
+
At original quality, it runs at ~300 tokens/second on a single a800 Nvidia GPU.
|
26 |
|
27 |
+
Trained similarly to Deepseek R1, we used Smollm2 as a base model, then we've SFT fine tuned on reasoning using our own private superthoughts instruct dataset & modified the tokenizer slightly, after the SFT fine tuning we used GRPO to further amplify it's mathematics & problem solving abilities.
|
28 |
|
29 |
# Format
|
30 |
```
|