Interact with images and texts using Qwen-VL-Max
Generate chat responses from user input
Generate images from text descriptions