tiantiaf
/

whisper-large-v3-msp-podcast-emotion

Audio Classification

model_hub_mixin

pytorch_model_hub_mixin

speech_emotion_recognition

Model card Files Files and versions Community

tiantiaf commited on May 24

Commit

5c04ab5

·

verified ·

1 Parent(s): 5d5194e

Update README.md

Files changed (1) hide show

README.md +3 -2

README.md CHANGED Viewed

@@ -16,8 +16,9 @@ pipeline_tag: audio-classification
 # Model Description
 This model includes the implementation of categorical emotion classification described in Vox-Profile: A Speech Foundation Model Benchmark for Characterizing Diverse Speaker and Speech Traits (https://arxiv.org/pdf/2505.14648)
-The training pipeline used is also the top-performing solution (SAILER) in INTERSPEECH 2025—Speech Emotion Challenge (https://lab-msp.com/MSP-Podcast_Competition/IS2025/). Note that we did not use all the augmentation and did not use the transcript to make the model simple but still effective.
-We use the MSP-Podcast data to train this model, noting that the model might be sensitive to content information in making the emotion prediction.
 The included emotions are:

 # Model Description
 This model includes the implementation of categorical emotion classification described in Vox-Profile: A Speech Foundation Model Benchmark for Characterizing Diverse Speaker and Speech Traits (https://arxiv.org/pdf/2505.14648)
+The training pipeline used is also the top-performing solution (SAILER) in INTERSPEECH 2025—Speech Emotion Challenge (https://lab-msp.com/MSP-Podcast_Competition/IS2025/).
+Note that we did not use all the augmentation and did not use the transcript compared to our official challenge submission system, but we created a speech-only system to make the model simple but still effective.
+We use the MSP-Podcast data to train this model, noting that the model might be sensitive to content information when making emotion predictions.
 The included emotions are: