Speaker diarization

Relies on pyannote.audio 2.0 currently in development: see installation instructions.

from pyannote.audio import Pipeline
pipeline = Pipeline.from_pretrained("AMITKESARI2000/pyannote_SD1")
output = pipeline("audio.wav")
for turn, _, speaker in output.itertracks(yield_label=True):
    # speaker speaks between turn.start and turn.end
    ...

Benchmark

Downloads last month
2
Inference API
Unable to determine this model's library. Check the docs .

Dataset used to train AMITKESARI2000/pyannote_SD1