Update README.md
Browse files
README.md
CHANGED
@@ -16,8 +16,9 @@ pipeline_tag: audio-classification
|
|
16 |
# Model Description
|
17 |
This model includes the implementation of categorical emotion classification described in Vox-Profile: A Speech Foundation Model Benchmark for Characterizing Diverse Speaker and Speech Traits (https://arxiv.org/pdf/2505.14648)
|
18 |
|
19 |
-
The training pipeline used is also the top-performing solution (SAILER) in INTERSPEECH 2025—Speech Emotion Challenge (https://lab-msp.com/MSP-Podcast_Competition/IS2025/).
|
20 |
-
|
|
|
21 |
|
22 |
|
23 |
The included emotions are:
|
|
|
16 |
# Model Description
|
17 |
This model includes the implementation of categorical emotion classification described in Vox-Profile: A Speech Foundation Model Benchmark for Characterizing Diverse Speaker and Speech Traits (https://arxiv.org/pdf/2505.14648)
|
18 |
|
19 |
+
The training pipeline used is also the top-performing solution (SAILER) in INTERSPEECH 2025—Speech Emotion Challenge (https://lab-msp.com/MSP-Podcast_Competition/IS2025/).
|
20 |
+
Note that we did not use all the augmentation and did not use the transcript compared to our official challenge submission system, but we created a speech-only system to make the model simple but still effective.
|
21 |
+
We use the MSP-Podcast data to train this model, noting that the model might be sensitive to content information when making emotion predictions.
|
22 |
|
23 |
|
24 |
The included emotions are:
|