GPT 4o like bot.
Generate images from text descriptions
Transcribe audio from microphone, files, or YouTube