Spaces:

open-r1
/

README

Running

App Files Files Community

SmolLm2-135 R1 Distill

by ewre324 - opened Jan 30

Discussion

ewre324

Jan 30

Hello, I just used SFT to produce an R1 distill.
https://huggingface.co/ewre324/ewre324-R1-SmolLM2-135M-Distill

Please use and comment if possible.

El-chapoo

Feb 4

i think the downside of thinking models is that even for simple question they may take alot of thinking tokens but i think we should have dataset to Train llms to figure out when to use thinking strategy and when to simply answer the question like regular llms do

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment