Generate audio and SRT subtitles from text
Convert source voice to target voice
Generate singing vocals from music scores