Edit model card

Wav2Vec2ForSpeechClassification

Fine-tuned model based on https://huggingface.co/r-f/wav2vec-english-speech-emotion-recognition. The model is trained on the full TESS, RAVDESS, and CREMA datasets. The Github repository is found here

Authored by:

Datasets

Performance Metrics

  • Test accuracy: 85%
  • Test loss: 0.4458

Overview

The model is based on Wav2Vec2 and fine-tuned on the aforementioned datasets.

Downloads last month
7
Safetensors
Model size
316M params
Tensor type
F32
Β·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Spaces using kvilla/wav2vec-english-speech-emotion-recognition-finetuned 2