ijik-loker commited on
Commit
9919215
·
1 Parent(s): 4e302d5

Add README.md

Browse files
Files changed (1) hide show
  1. README.md +76 -0
README.md ADDED
@@ -0,0 +1,76 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ tags:
5
+ - rvc
6
+ - voice cloning
7
+ - The Amazing World of Gumball
8
+ - Gumball Watterson
9
+ - Jacob Hopkins
10
+ ---
11
+
12
+ ## Model Details
13
+
14
+ Voice of Jacob Hopkins as Gumball Watterson in the Season 3 of the cartoon The Amazing World of Gumball.
15
+
16
+ ### Model Description
17
+
18
+ <!-- Provide a longer summary of what this model is. -->
19
+
20
+ - **Developed by:** [ijik-loker](https://huggingface.co/ijik-loker)
21
+ - **Model type:** [Retrieval-based Voice Conversion (RVC)](https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI)
22
+ - **Language(s):** English
23
+
24
+ ## Uses
25
+
26
+ <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
27
+
28
+ Used in the popular Retrieval-based Voice Conversion WebUI via inference or real-time using [Voice Changer](https://github.com/w-okada/voice-changer).
29
+ The index file should be used alongside the model.
30
+
31
+ ## Training Details
32
+
33
+ ### Training Data
34
+
35
+ <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
36
+
37
+ #### Voice clips dataset total duration
38
+
39
+ v1 model: 11min 18s
40
+
41
+ v2 model: 26min 50s
42
+
43
+ Trained using these episodes from Season 3:
44
+ 1. The Boss
45
+ 2. The Move
46
+ 3. The Burden
47
+ 4. The Bros
48
+ 5. The Countdown
49
+ 6. The Nobody
50
+ 7. The Fraud
51
+ 8. The Void
52
+ 9. The Name
53
+ 10. The Extras (1 line)
54
+ 11. The Oracle
55
+ 12. The Safety
56
+ 13. The Procrastinators
57
+ 14. The Puppy
58
+ 15. The Recipe
59
+ 16. The Society
60
+ 17. The Spoiler
61
+
62
+ ### Training Procedure
63
+
64
+ <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
65
+
66
+ 1. Remove noise using [Ultimate Vocal Remover 5](https://github.com/Anjok07/ultimatevocalremovergui) UVR-DeNoise.
67
+ 2. Extract vocals using RVC Web UI [HP5-主旋律人声vocals+其他instrumentals.pth](https://huggingface.co/lj1995/VoiceConversionWebUI/blob/main/uvr5_weights/HP5-%E4%B8%BB%E6%97%8B%E5%BE%8B%E4%BA%BA%E5%A3%B0vocals%2B%E5%85%B6%E4%BB%96instrumentals.pth).
68
+ 3. Remove echo and reverb using Ultimate Vocal Remover 5 UVR-DeEcho-DeReverb.
69
+ 4. Manually diarise voices in [Audacity](https://www.audacityteam.org/) using labels.
70
+ 5. Export multiple to .wav by labels.
71
+ 6. Train using RVC
72
+ * Target Sample Rate: 48k
73
+ * Version: v2
74
+ * Total training epochs: 200
75
+ * Base model G: f0G48k.pth
76
+ * Base model D: f0D48k.pth