mRAG: Elucidating the Design Space of Multi-modal Retrieval-Augmented Generation Paper • 2505.24073 • Published May 29
Demystifying the Visual Quality Paradox in Multimodal Large Language Models Paper • 2506.15645 • Published Jun 18 • 4
SAFEFLOW: A Principled Protocol for Trustworthy and Transactional Autonomous Agent Systems Paper • 2506.07564 • Published Jun 9 • 6
Generative AI for Autonomous Driving: Frontiers and Opportunities Paper • 2505.08854 • Published May 13 • 1
OpenEMMA: Open-Source Multimodal Model for End-to-End Autonomous Driving Paper • 2412.15208 • Published Dec 19, 2024
AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving Paper • 2412.15206 • Published Dec 19, 2024
Re-Align: Aligning Vision Language Models via Retrieval-Augmented Direct Preference Optimization Paper • 2502.13146 • Published Feb 18 • 1