NarrativeTrack: Evaluating Video Language Models Beyond the Frame Paper • 2601.01095 • Published 4 days ago • 6
SYNTHIA: Novel Concept Design with Affordance Composition Paper • 2502.17793 • Published Feb 25, 2025 • 1
MM-PoisonRAG: Disrupting Multimodal RAG with Local and Global Poisoning Attacks Paper • 2502.17832 • Published Feb 25, 2025 • 6
Visual Backdoor Attacks on MLLM Embodied Decision Making via Contrastive Trigger Learning Paper • 2510.27623 • Published Oct 31, 2025 • 12
EpiCache: Episodic KV Cache Management for Long Conversational Question Answering Paper • 2509.17396 • Published Sep 22, 2025 • 19
UserBench: An Interactive Gym Environment for User-Centric Agents Paper • 2507.22034 • Published Jul 29, 2025 • 29
Perception-Aware Policy Optimization for Multimodal Reasoning Paper • 2507.06448 • Published Jul 8, 2025 • 47