Collection of Distills using Open R1
asdf
ewre324
AI & ML interests
None yet
Recent Activity
upvoted
an
article
4 days ago
Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial
updated
a model
5 days ago
ewre324/ewre324-R1-Minueza-32M-Distill
updated
a collection
5 days ago
R1 Distill
Organizations
Collections
3
These models have been finetuned to perform reasoning, chain of thought.
-
ewre324/ewre324-Thinker-Llama-3.2-3B-Instruct-Reasoning
Updated • 256 -
ewre324/ewre324-Thinker-Qwen2.5-0.5B-Instruct-Reasoning
Updated • 28 -
ewre324/ewre324-Thinker-SmolLM2-135M-Instruct-Reasoning
Text Generation • Updated • 35 -
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Paper • 2201.11903 • Published • 9
models
8
ewre324/ewre324-R1-Minueza-32M-Distill
Updated
ewre324/ewre324-R1-SmolLM2-135M-Distill
Updated
•
16
ewre324/moondream2
Image-Text-to-Text
•
Updated
•
65
ewre324/ewre324-QwQ-0.5B-Distilled-SFT-Reason
Updated
•
7
ewre324/ewre324-Thinker-Llama-3.2-1B-Instruct-Reason
Updated
•
3
ewre324/ewre324-Thinker-Llama-3.2-3B-Instruct-Reasoning
Updated
•
256
ewre324/ewre324-Thinker-Qwen2.5-0.5B-Instruct-Reasoning
Updated
•
28
ewre324/ewre324-Thinker-SmolLM2-135M-Instruct-Reasoning
Text Generation
•
Updated
•
35