Making Large Multimodal Models Understand Arbitrary Visual Prompts Paper • 2312.00784 • Published Dec 1, 2023 • 2
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want Paper • 2403.20271 • Published Mar 29 • 2
Split & Merge: Unlocking the Potential of Visual Adapters via Sparse Training Paper • 2312.02923 • Published Dec 5, 2023 • 1