update README
Browse files- .gitattributes +1 -0
- README.md +1 -2
- figures/{VibeVoice.jpg → Fig1.png} +2 -2
- figures/MOS-preference.png +0 -0
.gitattributes
CHANGED
@@ -34,3 +34,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
36 |
*.jpg filter=lfs diff=lfs merge=lfs -text
|
|
|
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
36 |
*.jpg filter=lfs diff=lfs merge=lfs -text
|
37 |
+
*.png filter=lfs diff=lfs merge=lfs -text
|
README.md
CHANGED
@@ -23,8 +23,7 @@ The model can synthesize speech up to **90 minutes** long with up to **4 distinc
|
|
23 |
➡️ **Code:** [microsoft/VibeVoice-Code](https://github.com/microsoft/VibeVoice)
|
24 |
|
25 |
<p align="left">
|
26 |
-
<img src="figures/
|
27 |
-
<!-- <img src="figures/MOS-preference.png" alt="MOS Preference Results" height="260px"> -->
|
28 |
</p>
|
29 |
|
30 |
## Training details
|
|
|
23 |
➡️ **Code:** [microsoft/VibeVoice-Code](https://github.com/microsoft/VibeVoice)
|
24 |
|
25 |
<p align="left">
|
26 |
+
<img src="figures/Fig1.png" alt="VibeVoice Overview" height="250px">
|
|
|
27 |
</p>
|
28 |
|
29 |
## Training details
|
figures/{VibeVoice.jpg → Fig1.png}
RENAMED
File without changes
|
figures/MOS-preference.png
DELETED
Binary file (67.2 kB)
|
|