Think Only When You Need with Large Hybrid-Reasoning Models Paper • 2505.14631 • Published 4 days ago • 18
Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning Paper • 2505.13866 • Published 4 days ago • 14
Improving Assembly Code Performance with Large Language Models via Reinforcement Learning Paper • 2505.11480 • Published 8 days ago • 7
AdaCoT: Pareto-Optimal Adaptive Chain-of-Thought Triggering via Reinforcement Learning Paper • 2505.11896 • Published 7 days ago • 54
Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models Paper • 2505.10554 • Published 9 days ago • 113
Agentic Reasoning and Tool Integration for LLMs via Reinforcement Learning Paper • 2505.01441 • Published 26 days ago • 36
WebThinker: Empowering Large Reasoning Models with Deep Research Capability Paper • 2504.21776 • Published 24 days ago • 53