Scaling LLM Inference with Optimized Sample Compute Allocation Paper • 2410.22480 • Published Oct 29, 2024
Test-time Computing: from System-1 Thinking to System-2 Thinking Paper • 2501.02497 • Published 8 days ago • 33
Scaling of Search and Learning: A Roadmap to Reproduce o1 from Reinforcement Learning Perspective Paper • 2412.14135 • Published 25 days ago
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though Paper • 2501.04682 • Published 4 days ago • 72