Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation Paper • 2507.10524 • Published 21 days ago • 64
Self-Training Elicits Concise Reasoning in Large Language Models Paper • 2502.20122 • Published Feb 27 • 4