SmolR1-SFT-Alpha / README.md

mrfakename

Update README.md

3f3a468 verified 24 days ago

preview code

raw

history blame contribute delete

366 Bytes

metadata

license: apache-2.0
datasets:
  - bespokelabs/Bespoke-Stratos-17k
language:
  - en
base_model:
  - HuggingFaceTB/SmolLM2-1.7B-Instruct
pipeline_tag: text-generation
library_name: transformers
tags:
  - reasoning

SmolR1-SFT

Potential limitations:

Endless repetition
Mistakes in reasoning

Prompt format: ChatML

Trained using Hugging Face's Open-R1 framework.