ijik-loker's picture
Add README.md
35879fa
---
language:
- en
tags:
- rvc
- voice cloning
- The Amazing World of Gumball
- Anais Watterson
- Kyla Rae Kowalewski
---
## Model Details
Voice of Kyla Rae Kowalewski as Anais Watterson in Season 3 of the cartoon The Amazing World of Gumball.
### Model Description
<!-- Provide a longer summary of what this model is. -->
- **Developed by:** [ijik-loker](https://huggingface.co/ijik-loker)
- **Model type:** [Retrieval-based Voice Conversion (RVC)](https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI)
- **Language(s):** English
## Uses
<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
Used in the popular Retrieval-based Voice Conversion WebUI via inference or real-time using [Voice Changer](https://github.com/w-okada/voice-changer).
The index file should be used alongside the model.
## Training Details
### Training Data
<!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
#### Voice clips dataset total duration
v1 model: 03min 56s
Trained using these episodes from Season 3:
1. The Nobody
2. The Name
3. The Oracle
4. The Safety
5. The Procrastinators
6. The Puppy
7. The Spoiler
### Training Procedure
<!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
1. Remove noise using [Ultimate Vocal Remover 5](https://github.com/Anjok07/ultimatevocalremovergui) UVR-DeNoise.
2. Extract vocals using RVC Web UI [HP5-主旋律人声vocals+其他instrumentals.pth](https://huggingface.co/lj1995/VoiceConversionWebUI/blob/main/uvr5_weights/HP5-%E4%B8%BB%E6%97%8B%E5%BE%8B%E4%BA%BA%E5%A3%B0vocals%2B%E5%85%B6%E4%BB%96instrumentals.pth).
3. Remove echo and reverb using Ultimate Vocal Remover 5 UVR-DeEcho-DeReverb.
4. Manually diarise voices in [Audacity](https://www.audacityteam.org/) using labels.
5. Export multiple to .wav by labels.
6. Train using RVC
* Target Sample Rate: 48k
* Version: v2
* Total training epochs: 200
* Base model G: f0G48k.pth
* Base model D: f0D48k.pth