Open-R1: a fully open reproduction of DeepSeek-R1
β’
221
Hi @bojan2501 thanks, we will indeed be working hard to make sure this training recipe can work for small language models on consumer hardware since not everyone has a cluster of H100s at home :)
The tool we used for the images was Excalidraw! https://excalidraw.com