Teaching Language Models to Critique via Reinforcement Learning Paper • 2502.03492 • Published 22 days ago • 23
NatureLM: Deciphering the Language of Nature for Scientific Discovery Paper • 2502.07527 • Published 16 days ago • 18
MetaChain: A Fully-Automated and Zero-Code Framework for LLM Agents Paper • 2502.05957 • Published 17 days ago • 16
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 333