Calculate model vRAM usage
Analyze image to generate descriptive prompt
Analyze text using tuned lens and visualize predictions