Transcribe live audio to text
Convert spoken words into text
Talk to Gemini using Google's multimodal API
Interact with an AI agent to perform web tasks