Generate animated video between two images
Replace objects in images with new content
Generate depth map from image
Segment objects in images using prompts
Create 3D models from images
Generate audio from text using VITS
Generate anime character speech from text
Enhance and restore old photos with faces
Transcribe audio to text with speaker diarization