ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows Paper • 2505.19897 • Published May 26 • 102
On Path to Multimodal Generalist: General-Level and General-Bench Paper • 2505.04620 • Published May 7 • 82
Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models Paper • 2505.04921 • Published May 8 • 179
ZeroSearch: Incentivize the Search Capability of LLMs without Searching Paper • 2505.04588 • Published May 7 • 66
VLM papers for science Collection Collecting papers that help understand how well VLMs perform in tasks related to science • 6 items • Updated May 1
Chain-of-Defensive-Thought: Structured Reasoning Elicits Robustness in Large Language Models against Reference Corruption Paper • 2504.20769 • Published Apr 29 • 3
VLM papers for science Collection Collecting papers that help understand how well VLMs perform in tasks related to science • 6 items • Updated May 1
UniBiomed: A Universal Foundation Model for Grounded Biomedical Image Interpretation Paper • 2504.21336 • Published Apr 30 • 4
WebThinker: Empowering Large Reasoning Models with Deep Research Capability Paper • 2504.21776 • Published Apr 30 • 58
VLM papers for science Collection Collecting papers that help understand how well VLMs perform in tasks related to science • 6 items • Updated May 1