|
--- |
|
language: |
|
- en |
|
tags: |
|
- rvc |
|
- voice cloning |
|
- The Amazing World of Gumball |
|
- Gumball Watterson |
|
- Jacob Hopkins |
|
--- |
|
|
|
## Model Details |
|
|
|
Voice of Jacob Hopkins as Gumball Watterson in Season 3 of the cartoon The Amazing World of Gumball. |
|
|
|
### Model Description |
|
|
|
<!-- Provide a longer summary of what this model is. --> |
|
|
|
- **Developed by:** [ijik-loker](https://huggingface.co/ijik-loker) |
|
- **Model type:** [Retrieval-based Voice Conversion (RVC)](https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI) |
|
- **Language(s):** English |
|
|
|
## Uses |
|
|
|
<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. --> |
|
|
|
Used in the popular Retrieval-based Voice Conversion WebUI via inference or real-time using [Voice Changer](https://github.com/w-okada/voice-changer). |
|
The index file should be used alongside the model. |
|
|
|
## Training Details |
|
|
|
### Training Data |
|
|
|
<!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. --> |
|
|
|
#### Voice clips dataset total duration |
|
|
|
v1 model: 11min 18s |
|
|
|
v2 model: 26min 50s |
|
|
|
Trained using these episodes from Season 3: |
|
1. The Boss |
|
2. The Move |
|
3. The Burden |
|
4. The Bros |
|
5. The Countdown |
|
6. The Nobody |
|
7. The Fraud |
|
8. The Void |
|
9. The Name |
|
10. The Extras (1 line) |
|
11. The Oracle |
|
12. The Safety |
|
13. The Procrastinators |
|
14. The Puppy |
|
15. The Recipe |
|
16. The Society |
|
17. The Spoiler |
|
|
|
### Training Procedure |
|
|
|
<!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. --> |
|
|
|
1. Remove noise using [Ultimate Vocal Remover 5](https://github.com/Anjok07/ultimatevocalremovergui) UVR-DeNoise. |
|
2. Extract vocals using RVC Web UI [HP5-主旋律人声vocals+其他instrumentals.pth](https://huggingface.co/lj1995/VoiceConversionWebUI/blob/main/uvr5_weights/HP5-%E4%B8%BB%E6%97%8B%E5%BE%8B%E4%BA%BA%E5%A3%B0vocals%2B%E5%85%B6%E4%BB%96instrumentals.pth). |
|
3. Remove echo and reverb using Ultimate Vocal Remover 5 UVR-DeEcho-DeReverb. |
|
4. Manually diarise voices in [Audacity](https://www.audacityteam.org/) using labels. |
|
5. Export multiple to .wav by labels. |
|
6. Train using RVC |
|
* Target Sample Rate: 48k |
|
* Version: v2 |
|
* Total training epochs: 200 |
|
* Base model G: f0G48k.pth |
|
* Base model D: f0D48k.pth |
|
|
|
## Evaluation |
|
|
|
<!-- This section describes the evaluation protocols and provides the results. --> |
|
|
|
#### Summary |
|
|
|
v1 seems to perform just fine. The v2 voice sounds coarse at times. |