Pendrokar/xvapitch_expresso

xVASynth's xVAPitch (v3) type of voice models based on the Expresso dataset. Without enunciated, laughing, whispering and singing styles. From the confused style, only questions were used.

These models can also do emphasis on words by using colons :, rather than the typical quotemarks " which are skipped by the xVASynth text pre-processor:

What :exactly: is it?
Well :normally: we just let it run.

ex01 male:

ex02 female:

ex03 male:

ex04 female: (These audio samples were created with the xVASynth Editor with the SR option (44kHz), not xVATrainer whose automatically created samples often sound different)

Legal note: Although these datasets are licensed as CC BY 4.0, the base v3 model that these models are fine-tuned from, was pre-trained on non-permissive data.

v3 base model: https://huggingface.co/Pendrokar/xvapitch

Pendrokar
/

xvapitch_expresso

Model tree for Pendrokar/xvapitch_expresso

Dataset used to train Pendrokar/xvapitch_expresso

Spaces using Pendrokar/xvapitch_expresso 2