Safeguarding Vision-Language Models: Mitigating Vulnerabilities to Gaussian Noise in Perturbation-based Attacks Paper • 2504.01308 • Published 2 days ago • 11
PaperBench: Evaluating AI's Ability to Replicate AI Research Paper • 2504.01848 • Published 1 day ago • 20
Efficient LLaMA-3.2-Vision by Trimming Cross-attended Visual Features Paper • 2504.00557 • Published 3 days ago • 12
Landscape of Thoughts: Visualizing the Reasoning Process of Large Language Models Paper • 2503.22165 • Published 7 days ago • 20
On Large Multimodal Models as Open-World Image Classifiers Paper • 2503.21851 • Published 7 days ago • 4
MedAgent-Pro: Towards Multi-modal Evidence-based Medical Diagnosis via Reasoning Agentic Workflow Paper • 2503.18968 • Published 14 days ago • 5
Attention IoU: Examining Biases in CelebA using Attention Maps Paper • 2503.19846 • Published 9 days ago • 7
I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders Paper • 2503.18878 • Published 10 days ago • 110
When Less is Enough: Adaptive Token Reduction for Efficient Image Representation Paper • 2503.16660 • Published 14 days ago • 70
Where do Large Vision-Language Models Look at when Answering Questions? Paper • 2503.13891 • Published 17 days ago • 8
CaKE: Circuit-aware Editing Enables Generalizable Knowledge Learners Paper • 2503.16356 • Published 14 days ago • 15