Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models Paper โข 2505.14810 โข Published May 20 โข 61
NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop System from Hypothesis to Verification Paper โข 2505.16938 โข Published May 22 โข 118