Getting it Right: Improving Spatial Consistency in Text-to-Image Models Paper • 2404.01197 • Published Apr 1, 2024 • 32
LVLM-Intrepret: An Interpretability Tool for Large Vision-Language Models Paper • 2404.03118 • Published Apr 3, 2024 • 27
FastRM: An efficient and automatic explainability framework for multimodal generative models Paper • 2412.01487 • Published Dec 2, 2024 • 1