mrfakename
/

SmolR1-SFT-Alpha

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

SmolR1-SFT-Alpha / README.md

mrfakename's picture

Update README.md

3f3a468 verified 24 days ago

|

history blame contribute delete

366 Bytes

	---
	license: apache-2.0
	datasets:
	- bespokelabs/Bespoke-Stratos-17k
	language:
	- en
	base_model:
	- HuggingFaceTB/SmolLM2-1.7B-Instruct
	pipeline_tag: text-generation
	library_name: transformers
	tags:
	- reasoning
	---
	# SmolR1-SFT

	Potential limitations:

	* Endless repetition
	* Mistakes in reasoning

	Prompt format: ChatML

	Trained using Hugging Face's Open-R1 framework.