Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning Paper • 2505.15966 • Published 3 days ago • 43 • 2
General-Reasoner: Advancing LLM Reasoning Across All Domains Paper • 2505.14652 • Published 4 days ago • 17 • 6
VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning Paper • 2504.08837 • Published Apr 10 • 42 • 2