Detect objects in images and get bounding boxes
Convert your face photo into anime style
Transcribe audio from microphone, file, or YouTube link
Generate personalized images with a face preservation
Generate images from text descriptions
Generate edited images with prompts
Execute commands based on environment variables
Generate high-resolution images with text prompts
perfect ocr vlm
Analyze image to generate descriptive prompt
Convert PDFs or images to Markdown with OCR and layout analysis
Generate corrected text with reference
Generate images from text prompts with a specific style
CPU powered, low RTF, emotional, multilingual TTS