File size: 670 Bytes
cd5f21f 9043259 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 |
---
license: mit
datasets:
- Wenetspeech4TTS/WenetSpeech4TTS
language:
- zh
pipeline_tag: text-to-speech
---
## The vanilla NaturalSpeech2 train on WenetSpeech4TTS using Amphion tooltik.
The entire training process follows its training code, except that the text-to-phoneme feature step and text encoder are slightly different.
### Checkpoints
- **ns2_base_model.bin** : NaturalSpeech2 trained with the WenetSpeech4TTS Basic subset
- **ns2_38_model.bin** : NaturalSpeech2 Basic fine-tuning with the WenetSpeech4TTS Standard subset
- **ns2_4_model.bin** : NaturalSpeech2 Standard fine-tuning with the WenetSpeech4TTS Premium subset
### usage
coming soon.
```
```
|