view article Article Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial By open-r1 • 5 days ago • 25
Chain Of Thought Reasoning Collection These models have been finetuned to perform reasoning, chain of thought. • 6 items • Updated 25 days ago