Transformer Interpretability Beyond Attention Visualization Paper • 2012.09838 • Published Dec 17, 2020 • 1
Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models Paper • 2505.10554 • Published 9 days ago • 113
Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models Paper • 2505.04921 • Published 17 days ago • 144
Absolute Zero: Reinforced Self-play Reasoning with Zero Data Paper • 2505.03335 • Published 18 days ago • 159
Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning Paper • 2505.03318 • Published 18 days ago • 91
Reinforcement Learning for Reasoning in Large Language Models with One Training Example Paper • 2504.20571 • Published 25 days ago • 91
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models Paper • 2501.03262 • Published Jan 4 • 99
Qwen2.5-Math Collection Math-specific model series based on Qwen2.5 • 11 items • Updated 26 days ago • 81
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems Paper • 2504.01990 • Published Mar 31 • 279
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Paper • 2502.05171 • Published Feb 7 • 140
Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning Paper • 2503.07572 • Published Mar 10 • 44