- Trained on a mixture of 2.1k examples, most of which were manually gathered and verified. - **The model will think much shorter than QWQ and R1 models thanks to brief but high quality SFT dataset.** - System messages to trigger think tags (which should be placed under "\user"): "You are an expert assistant. Think using \ tags." "You are a thinking assistant." Example: ``` user You are a thinking assistant. (1,234)² means 1,234 * 1,234; (1,234)³ means 1,234 * 1,234 * 1,234; and so forth. When (1,234)²³ is completely multiplied out, what will the number be in the ones place? ... ``` The model **can** miss the \ tag sometimes. You can add this to template, or try again with a different seed. The model can also think about an image as well: ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6324eabf05bd8a54c6eb1650/mWJ9WmKzmBD1mEXiUaE_U.png) ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6324eabf05bd8a54c6eb1650/d-VGBpkWZLSLMNpSonOY3.png)