MegaTTS 3: Sparse Alignment Enhanced Latent Diffusion Transformer for Zero-Shot Speech Synthesis
Paper
•
2502.18924
•
Published
•
12
TTS models that support zero-shot voice cloning
Note https://github.com/yl4579/StyleTTS-ZS (Official code not released yet, still under development)
Note Unofficial implementation: https://github.com/lucidrains/e2-tts-pytorch