File size: 2,407 Bytes
aff9ca6
 
 
 
 
 
 
 
 
 
 
 
 
144b66d
aff9ca6
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
---
language:
- en
tags:
- rvc
- voice cloning
- The Amazing World of Gumball
- Darwin Watterson
- Terrell Ransom, Jr.
---

## Model Details

Voice of Terrell Ransom, Jr. as Darwin Watterson in Season 3 of the cartoon The Amazing World of Gumball.

### Model Description

<!-- Provide a longer summary of what this model is. -->

- **Developed by:** [ijik-loker](https://huggingface.co/ijik-loker)
- **Model type:** [Retrieval-based Voice Conversion (RVC)](https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI)
- **Language(s):** English

## Uses

<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->

Used in the popular Retrieval-based Voice Conversion WebUI via inference or real-time using [Voice Changer](https://github.com/w-okada/voice-changer).
The index file should be used alongside the model.

## Training Details

### Training Data

<!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->

#### Voice clips dataset total duration

v1 model: 16min 14s

Trained using these episodes from Season 3:
1. The Boss
2. The Move
3. The Burden
4. The Bros
5. The Countdown
6. The Nobody
7. The Fraud
8. The Void
9. The Name
10. The Oracle
11. The Safety
12. The Procrastinators
13. The Puppy
14. The Recipe
15. The Society
16. The Spoiler

### Training Procedure 

<!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->

1. Remove noise using [Ultimate Vocal Remover 5](https://github.com/Anjok07/ultimatevocalremovergui) UVR-DeNoise.
2. Extract vocals using RVC Web UI [HP5-主旋律人声vocals+其他instrumentals.pth](https://huggingface.co/lj1995/VoiceConversionWebUI/blob/main/uvr5_weights/HP5-%E4%B8%BB%E6%97%8B%E5%BE%8B%E4%BA%BA%E5%A3%B0vocals%2B%E5%85%B6%E4%BB%96instrumentals.pth).
3. Remove echo and reverb using Ultimate Vocal Remover 5 UVR-DeEcho-DeReverb.
4. Manually diarise voices in [Audacity](https://www.audacityteam.org/) using labels.
5. Export multiple to .wav by labels.
6. Train using RVC
  * Target Sample Rate: 48k
  * Version: v2
  * Total training epochs: 200
  * Base model G: f0G48k.pth
  * Base model D: f0D48k.pth