Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
rasdani
's Collections
smolR1
smolR1
updated
14 days ago
reproducing DeepSeek R1 Zero with Qwen2.5-0.5B on two 4090 GPUs
Upvote
-
rasdani/smolR1-Qwen2.5-0.5B
Text Generation
•
Updated
14 days ago
•
38
rasdani/simplerl_qwen_level1to4
Viewer
•
Updated
16 days ago
•
8.14k
•
98
Upvote
-
Share collection
View history
Collection guide
Browse collections