Beyond Binary Rewards: Training LMs to Reason About Their Uncertainty Paper • 2507.16806 • Published 15 days ago • 6