padmalcom
/

wav2vec2-large-nonverbalvocalization-classification

Audio Classification

Model card Files Files and versions Community

This language indendent wav2vec2 classification model is based on this dataset.

Sound classes are:

teeth-chattering
teeth-grinding
tongue-clicking
nose-blowing
coughing
yawning
throat clearing
sighing
lip-popping
lip-smacking
panting
crying
laughing
sneezing
moaning
screaming

inference.py shows, how the model can be used.

Downloads last month: 1,415

Inference Providers NEW

Audio Classification

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support