bofenghuang/whisper-large-v3-french-distil-dec16

Hello.

I'm trying to implement French ASR. We conducted tests on the MLS dataset with the same model as the one you uploaded("bofenghuang/whisper-large-v3-french-distil-dec16"). According to the model card, a performance of 3.57 was achieved in this situation. However, when we ran the test, it was recorded as 4.64, which is slightly lower performance than you mentioned. In training details, you mentioned there were quality issues with the dataset that you fixed.

I'm wondering if there were also quality issue corrections or preprocessing for the test dataset?
If so, would it be possible to share the actual dataset you used for testing?
Additionally, I would like to ask if there were any preprocessing steps you applied to the results before measuring WER, and if so, could you share those details as well?

Thank you,

bofenghuang
/

whisper-large-v3-french-distil-dec16

About Performance Score(WER)