Model and data for ReflectiVA: Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering [CVPR 2025]
Federico Cocchi
fede97
AI & ML interests
Multimodal LLM - Computer Vision
Recent Activity
updated
a model
about 1 hour ago
aimagelab/LLaVA_MORE-llama_3_1-8B-S2-siglip-finetuning
updated
a model
about 1 hour ago
aimagelab/LLaVA_MORE-llama_3_1-8B-S2-siglip-pretrain
updated
a model
about 1 hour ago
aimagelab/LLaVA_MORE-llama_3_1-8B-S2-finetuning
Organizations
Collections
5
models
None public yet
datasets
5
fede97/external_test_set_v1
Viewer
•
Updated
•
340
•
55
fede97/external_data_test_example_v3
Updated
•
5
fede97/external_data_test_example
Viewer
•
Updated
•
410
•
89
fede97/external_data_test_example_v2
Viewer
•
Updated
•
410
•
111
fede97/dpo_demo
Viewer
•
Updated
•
148k
•
97