Model works best with 6 seconds of reference, not 3
Browse files
README.md
CHANGED
@@ -7,7 +7,7 @@ pipeline_tag: text-to-speech
|
|
7 |
---
|
8 |
|
9 |
# ⓍTTS
|
10 |
-
ⓍTTS is a Voice generation model that lets you clone voices into different languages by using just a quick
|
11 |
ⓍTTS has important model changes that make cross-language voice cloning and multi-lingual speech generation super easy.
|
12 |
There is no need for an excessive amount of training data that spans countless hours.
|
13 |
|
@@ -16,7 +16,7 @@ a few tricks to make it faster and support streaming inference.
|
|
16 |
|
17 |
### Features
|
18 |
- Supports 14 languages.
|
19 |
-
- Voice cloning with just a
|
20 |
- Emotion and style transfer by cloning.
|
21 |
- Cross-language voice cloning.
|
22 |
- Multi-lingual speech generation.
|
|
|
7 |
---
|
8 |
|
9 |
# ⓍTTS
|
10 |
+
ⓍTTS is a Voice generation model that lets you clone voices into different languages by using just a quick 6-second audio clip. Built on Tortoise,
|
11 |
ⓍTTS has important model changes that make cross-language voice cloning and multi-lingual speech generation super easy.
|
12 |
There is no need for an excessive amount of training data that spans countless hours.
|
13 |
|
|
|
16 |
|
17 |
### Features
|
18 |
- Supports 14 languages.
|
19 |
+
- Voice cloning with just a 6-second audio clip.
|
20 |
- Emotion and style transfer by cloning.
|
21 |
- Cross-language voice cloning.
|
22 |
- Multi-lingual speech generation.
|