Model and data for ReflectiVA: Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering [CVPR 2025]
Federico Cocchi
fede97
AI & ML interests
Multimodal LLM - Computer Vision
Recent Activity
new activity
1 day ago
aimagelab/RAID:Add task category and link to paper
updated
a model
3 days ago
aimagelab/LLaVA-MORE-Minerva
published
a model
3 days ago
aimagelab/LLaVA-MORE-Minerva