RL Reasoning - a dipperKW Collection

dipperKW 's Collections

RL Reasoning

updated about 11 hours ago

R-Zero: Self-Evolving Reasoning LLM from Zero Data

Paper • 2508.05004 • Published 1 day ago • 55
Don't Overthink It: A Survey of Efficient R1-style Large Reasoning Models

Paper • 2508.02120 • Published 4 days ago • 5