Pinkstack commited on
Commit
0b27704
·
verified ·
1 Parent(s): e63103e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -2
README.md CHANGED
@@ -21,9 +21,10 @@ pipeline_tag: text-generation
21
  ![superthoughts lite](https://cdn-uploads.huggingface.co/production/uploads/6710ba6af1279fe0dfe33afe/K5kYIHYj2aX2kB6MlcM9O.png)
22
 
23
  # Information
24
- Advanced, high-quality and lite reasoning for a tiny size that you can run on your phone.
 
25
 
26
- Trained similarly to Deepseek R1, we used Smollm2 as a base model, then we've SFT fine tuned in on reasoning & modified the tokenizer slightly, after the SFT fine tuning we used GRPO to further amplify it's mathematics & problem solving abilities.
27
 
28
  # Format
29
  ```
 
21
  ![superthoughts lite](https://cdn-uploads.huggingface.co/production/uploads/6710ba6af1279fe0dfe33afe/K5kYIHYj2aX2kB6MlcM9O.png)
22
 
23
  # Information
24
+ Advanced, high-quality and **lite** reasoning for a tiny size that you can run on your phone.
25
+ At original quality, it runs at ~300 tokens/second on a single a800 Nvidia GPU.
26
 
27
+ Trained similarly to Deepseek R1, we used Smollm2 as a base model, then we've SFT fine tuned on reasoning using our own private superthoughts instruct dataset & modified the tokenizer slightly, after the SFT fine tuning we used GRPO to further amplify it's mathematics & problem solving abilities.
28
 
29
  # Format
30
  ```