
aimagelab/ReflectiVA
Image-Text-to-Text
•
Updated
•
61
•
2
Model and data for ReflectiVA: Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering [CVPR 2025]