The Hallucination Tax of Reinforcement Finetuning Paper โข 2505.13988 โข Published May 20 โข 8 โข 2
Efficient Reinforcement Finetuning via Adaptive Curriculum Learning Paper โข 2504.05520 โข Published Apr 7 โข 10 โข 2
Discovering Knowledge Deficiencies of Language Models on Massive Knowledge Base Paper โข 2503.23361 โข Published Mar 30 โข 6 โข 2