Think-RM: Enabling Long-Horizon Reasoning in Generative Reward Models Paper • 2505.16265 • Published 3 days ago • 6
Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space Paper • 2505.15778 • Published 3 days ago • 10