Chat with an AI that understands images and text
Generate answers using a text-based model
Segment objects in images and videos using text prompts
Answer questions about images by chatting