An Open Dataset and Model for Language Identification Paper • 2305.13820 • Published May 23, 2023
The University of Edinburgh's Submission to the WMT22 Code-Mixing Shared Task (MixMT) Paper • 2210.11309 • Published Oct 20, 2022
An Expanded Massive Multilingual Dataset for High-Performance Language Technologies Paper • 2503.10267 • Published Mar 13 • 1
speakleash/Bielik-7B-Instruct-v0.1 Text Generation • 7B • Updated Oct 26, 2024 • 2.18k • • 58