Create and launch a voice synthesis interface
contains 3 state-of-the-art models
Animate an image using a driving video or pickle
Transcribe audio to text in Eastern languages
Visualize camera simulations and E.T. datasets