LinTO Audio and Textual Datasets to Train and Evaluate Automatic Speech Recognition in Tunisian Arabic Dialect Paper • 2504.02604 • Published Apr 3 • 1
The Lucie-7B LLM and the Lucie Training Dataset: Open resources for multilingual language generation Paper • 2503.12294 • Published Mar 15 • 1
The Lucie-7B LLM and the Lucie Training Dataset: Open resources for multilingual language generation Paper • 2503.12294 • Published Mar 15 • 1
TIAM -- A Metric for Evaluating Alignment in Text-to-Image Generation Paper • 2307.05134 • Published Jul 11, 2023 • 3
CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters Paper • 2010.10392 • Published Oct 20, 2020 • 1