File size: 548 Bytes
24fa937
 
 
dfb2537
 
d9fa9ae
b8230cf
dfb2537
0938bb3
dfb2537
 
 
 
db11771
dfb2537
 
ee1931e
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
---
license: cc-by-4.0
---

```
pip install masked_prosody_model
pip install git+https://github.com/minixc/srmrpy
```
torch and torchaudio need to be installed as well.

```python
from masked_prosody_model import MaskedProsodyModel
model = MaskedProsodyModel.from_pretrained("cdminix/masked_prosody_model")
representation = model.process_audio("some_audio.wav", layer=7) # layer between 0 and 15, 7 was used in the paper
```

This model was trained using Cloud TPUs supplied by Google’s TPU Research Cloud (TRC). I thank them for their support.