File size: 548 Bytes
24fa937 dfb2537 d9fa9ae b8230cf dfb2537 0938bb3 dfb2537 db11771 dfb2537 ee1931e |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 |
---
license: cc-by-4.0
---
```
pip install masked_prosody_model
pip install git+https://github.com/minixc/srmrpy
```
torch and torchaudio need to be installed as well.
```python
from masked_prosody_model import MaskedProsodyModel
model = MaskedProsodyModel.from_pretrained("cdminix/masked_prosody_model")
representation = model.process_audio("some_audio.wav", layer=7) # layer between 0 and 15, 7 was used in the paper
```
This model was trained using Cloud TPUs supplied by Google’s TPU Research Cloud (TRC). I thank them for their support.
|