GPT2 model for German Leichte Sprache (Easy language)
A German Leichte Sprache (Easy language) model based on mGPT.
See our code here: https://github.com/MiriUll/Language-Models-German-Simplification
See our paper here: Language Models for German Text Simplification: Overcoming Parallel Data Scarcity through Style-specific Pre-training
Dataset
This model was fine-tuned on a collection of monolingual Leichte Sprache data. This corpus can be recreated here.
Citation
If you use this model, please cite our paper:
@inproceedings{anschutz-etal-2023-language,
β  title = "Language Models for {G}erman Text Simplification: Overcoming Parallel Data Scarcity through Style-specific Pre-training",
β  author = {Ansch{"u}tz, Miriam  and Oehms, Joshua  and Wimmer, Thomas  and Jezierski, Bart{\l}omiej  and Groh, Georg},
β  booktitle = "Findings of the Association for Computational Linguistics: ACL 2023",
β  month = jul,
β  year = "2023",
β  address = "Toronto, Canada",
β  publisher = "Association for Computational Linguistics",
β  url = "https://aclanthology.org/2023.findings-acl.74",
β  pages = "1147--1158",
}
- Downloads last month
 - 2