CapSpeech TTS
Stylized TTS β design voice, accent, and emotion your way
A Step Towards Music Generation Foundation Model
Generate an edited image based on text instructions
Generate audio from video or text prompts
Edit an image based on the given instruction.
Insert images into backgrounds using masks or text labels
Generate realistic dialogue from a script, using Dia!
official demo for omnitalker
Generate high-quality images from text descriptions
Replace characters in a video with characters in photos
Generate audio and video from text prompts