SMMILE: An Expert-Driven Benchmark for Multimodal Medical In-Context Learning Paper • 2506.21355 • Published 7 days ago • 7
MARBLE: A Hard Benchmark for Multimodal Spatial Reasoning and Planning Paper • 2506.22992 • Published 5 days ago • 11
MARBLE: A Hard Benchmark for Multimodal Spatial Reasoning and Planning Paper • 2506.22992 • Published 5 days ago • 11 • 4
SMMILE: An Expert-Driven Benchmark for Multimodal Medical In-Context Learning Paper • 2506.21355 • Published 7 days ago • 7
SMMILE: An Expert-Driven Benchmark for Multimodal Medical In-Context Learning Paper • 2506.21355 • Published 7 days ago • 7 • 1
Reverse Image Retrieval Cues Parametric Memory in Multimodal LLMs Paper • 2405.18740 • Published May 29, 2024
Almanac Copilot: Towards Autonomous Electronic Health Record Navigation Paper • 2405.07896 • Published Apr 30, 2024
AgentClinic: a multimodal agent benchmark to evaluate AI in simulated clinical environments Paper • 2405.07960 • Published May 13, 2024 • 1
MIRIAD: Augmenting LLMs with millions of medical query-response pairs Paper • 2506.06091 • Published 27 days ago • 8
Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards Paper • 2506.11474 • Published 21 days ago • 17
Predicting sepsis in multi-site, multi-national intensive care cohorts using deep learning Paper • 2107.05230 • Published Jul 12, 2021
Almanac: Retrieval-Augmented Language Models for Clinical Medicine Paper • 2303.01229 • Published Mar 1, 2023 • 1