Do Audio-Visual Large Language Models Really See and Hear? Paper • 2604.02605 • Published 21 days ago • 7