Text-to-Speech
Vietnamese
vietnamese
female
male
voice-cloning
erax commited on
Commit
8497e8d
·
verified ·
1 Parent(s): 747e349

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -2
README.md CHANGED
@@ -36,8 +36,11 @@ tags:
36
 
37
  Hey there, fellow Vietnamese AI explorers! 👋
38
 
39
- We took the rather clever [F5-TTS model](https://arxiv.org/abs/2410.06885), with ++800,000 samples from public repository and from a huge 500h private dataset whom was kindly giving us a right to use it.
40
- We gave it a nudge with almost 1 millions steps update o 4xRTX3090 towards Vietnamese TTS, and sprinkled in some voice cloning capabilities because... well, why not? We're calling this little experiment **EraX-Smile-Female-F5-V1.0**. We hope it brings a smile (or at least doesn't make you frown *too* much).
 
 
 
41
 
42
  ## Does it actually work? Let's listen! 🎧
43
 
 
36
 
37
  Hey there, fellow Vietnamese AI explorers! 👋
38
 
39
+ We introduce **EraX-Smile-Female-F5-V1.0**, a Vietnamese text-to-speech model developed based on the F5-TTS architecture [arXiv:2410.06885](https://arxiv.org/abs/2410.06885).
40
+ To adapt this model for Vietnamese, we utilized a substantial dataset combining over 800,000 samples, of which some are from public repositories and with an extensive 500-hour private dataset, for which we gratefully acknowledge obtaining usage rights.
41
+ The model underwent significant training, involving approximately 1 million update steps on a 4x RTX 3090 configuration. It tooks almost a week with some crashes and burns too 🔥
42
+
43
+ Our hope is that EraX-Smile-Female-F5-V1.0 (soon UniSex) proves to be a useful contribution to the community for ethical and creative purposes.
44
 
45
  ## Does it actually work? Let's listen! 🎧
46