RealICU: Do LLM Agents Understand Long-Context ICU Data? A Benchmark Beyond Behavior Imitation Paper • 2605.13542 • Published 3 days ago • 6
RealICU: Do LLM Agents Understand Long-Context ICU Data? A Benchmark Beyond Behavior Imitation Paper • 2605.13542 • Published 3 days ago • 6
Solving Physics Olympiad via Reinforcement Learning on Physics Simulators Paper • 2604.11805 • Published Apr 13 • 16
MedOpenClaw: Auditable Medical Imaging Agents Reasoning over Uncurated Full Studies Paper • 2603.24649 • Published Mar 25 • 31
MedOpenClaw: Auditable Medical Imaging Agents Reasoning over Uncurated Full Studies Paper • 2603.24649 • Published Mar 25 • 31
Ariadne: A Controllable Framework for Probing and Extending VLM Reasoning Boundaries Paper • 2511.00710 • Published Nov 1, 2025 • 5
M3Ret: Unleashing Zero-shot Multimodal Medical Image Retrieval via Self-Supervision Paper • 2509.01360 • Published Sep 1, 2025 • 12
SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement Learning Paper • 2506.01713 • Published Jun 2, 2025 • 48
Beyond Distillation: Pushing the Limits of Medical LLM Reasoning with Minimalist Rule-Based RL Paper • 2505.17952 • Published May 23, 2025 • 20
Beyond Distillation: Pushing the Limits of Medical LLM Reasoning with Minimalist Rule-Based RL Paper • 2505.17952 • Published May 23, 2025 • 20
NOVA: A Benchmark for Anomaly Localization and Clinical Reasoning in Brain MRI Paper • 2505.14064 • Published May 20, 2025 • 19