RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling Paper • 2506.08672 • Published Jun 10 • 31
ReflectEvo: Improving Meta Introspection of Small LLMs by Learning Self-Reflection Paper • 2505.16475 • Published May 22 • 2
Revisiting a Pain in the Neck: Semantic Phrase Processing Benchmark for Language Models Paper • 2405.02861 • Published May 5, 2024 • 1