Papers
arxiv:2506.02894

A Multi-Dialectal Dataset for German Dialect ASR and Dialect-to-Standard Speech Translation

Published on Jun 3
Authors:
,
,

Abstract

Betthupferl evaluates the robustness of multilingual ASR models to dialectal variations in Southeast Germany by comparing their outputs to both dialectal and standardized transcriptions.

AI-generated summary

Although Germany has a diverse landscape of dialects, they are underrepresented in current automatic speech recognition (ASR) research. To enable studies of how robust models are towards dialectal variation, we present Betthupferl, an evaluation dataset containing four hours of read speech in three dialect groups spoken in Southeast Germany (Franconian, Bavarian, Alemannic), and half an hour of Standard German speech. We provide both dialectal and Standard German transcriptions, and analyze the linguistic differences between them. We benchmark several multilingual state-of-the-art ASR models on speech translation into Standard German, and find differences between how much the output resembles the dialectal vs. standardized transcriptions. Qualitative error analyses of the best ASR model reveal that it sometimes normalizes grammatical differences, but often stays closer to the dialectal constructions.

Community

Repo on GitHub is here: https://github.com/mainlp/betthupferl

@verenablaschke @mi-winkler @constantinSch @bplank Many thanks for releasing this great resource!

Sign up or log in to comment

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2506.02894 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2506.02894 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2506.02894 in a Space README.md to link it from this page.

Collections including this paper 1