Running on Zero 14 14 Explainable-Vision-Language-Model 🥶 Generate a video visualizing a model's attention on an image while generating text