Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages Paper • 2308.12038 • Published Aug 23, 2023 • 2
A Topic-level Self-Correctional Approach to Mitigate Hallucinations in MLLMs Paper • 2411.17265 • Published Nov 26, 2024 • 1
RLPR: Extrapolating RLVR to General Domains without Verifiers Paper • 2506.18254 • Published 25 days ago • 32
The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models Paper • 2505.22617 • Published May 28 • 125
The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models Paper • 2505.22617 • Published May 28 • 125
The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models Paper • 2505.22617 • Published May 28 • 125
A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond Paper • 2503.21614 • Published Mar 27 • 42
Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models Paper • 2503.11224 • Published Mar 14 • 29