coqui
/

XTTS-v1

reubenm commited on Oct 25, 2023

Commit

c386dfb

•

1 Parent(s): b45432f

Model works best with 6 seconds of reference, not 3

Files changed (1) hide show

README.md CHANGED Viewed

@@ -7,7 +7,7 @@ pipeline_tag: text-to-speech
 ---
 # ⓍTTS
-ⓍTTS is a Voice generation model that lets you clone voices into different languages by using just a quick 3-second audio clip. Built on Tortoise,
 ⓍTTS has important model changes that make cross-language voice cloning and multi-lingual speech generation super easy.
 There is no need for an excessive amount of training data that spans countless hours.
@@ -16,7 +16,7 @@ a few tricks to make it faster and support streaming inference.
 ### Features
 - Supports 14 languages.
-- Voice cloning with just a 3-second audio clip.
 - Emotion and style transfer by cloning.
 - Cross-language voice cloning.
 - Multi-lingual speech generation.

 ---
 # ⓍTTS
+ⓍTTS is a Voice generation model that lets you clone voices into different languages by using just a quick 6-second audio clip. Built on Tortoise,
 ⓍTTS has important model changes that make cross-language voice cloning and multi-lingual speech generation super easy.
 There is no need for an excessive amount of training data that spans countless hours.
 ### Features
 - Supports 14 languages.
+- Voice cloning with just a 6-second audio clip.
 - Emotion and style transfer by cloning.
 - Cross-language voice cloning.
 - Multi-lingual speech generation.