Generate text by combining an image and a question
Select coordinates on an image based on instructions
Upload documents and ask questions