Convert text to speech with adjustable voice settings
GPT 4o like bot.
Generate clickable coordinates on a screenshot