Dimple: Discrete Diffusion Multimodal Large Language Model with Parallel Decoding Paper • 2505.16990 • Published 2 days ago • 17 • 2
Introducing Visual Perception Token into Multimodal Large Language Model Paper • 2502.17425 • Published Feb 24 • 15 • 2
Attention Prompting on Image for Large Vision-Language Models Paper • 2409.17143 • Published Sep 25, 2024 • 7 • 2