🧙 Guru - a koalazf99 Collection

koalazf99 's Collections

🐙 OctoThinker

🫐 ProX Projects

🧙 Guru

updated Jun 20

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Paper • 2506.14965 • Published Jun 17 • 49
LLM360/guru-RL-92k

Viewer • Updated 6 days ago • 91.9k • 860 • 17
LLM360/guru-7B

Text Generation • 8B • Updated Jun 19 • 1.14k • • 1
LLM360/guru-32B

Text Generation • 33B • Updated Jun 19 • 16