Generate descriptive prompts from images
Generate high-resolution images from text prompts
Process and tokenize text input