Is Preference Alignment Always the Best Option to Enhance LLM-Based Translation? An Empirical Analysis Paper • 2409.20059 • Published Sep 30, 2024 • 16
Segment Any Text: A Universal Approach for Robust, Efficient and Adaptable Sentence Segmentation Paper • 2406.16678 • Published Jun 24, 2024 • 16
Where's the Point? Self-Supervised Multilingual Punctuation-Agnostic Sentence Segmentation Paper • 2305.18893 • Published May 30, 2023 • 2
CompoundPiece: Evaluating and Improving Decompounding Performance of Language Models Paper • 2305.14214 • Published May 23, 2023
HumSet: Dataset of Multilingual Information Extraction and Classification for Humanitarian Crisis Response Paper • 2210.04573 • Published Oct 10, 2022
Tower: An Open Multilingual Large Language Model for Translation-Related Tasks Paper • 2402.17733 • Published Feb 27, 2024 • 5
CroissantLLM: A Truly Bilingual French-English Language Model Paper • 2402.00786 • Published Feb 1, 2024 • 26
WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models Paper • 2112.06598 • Published Dec 13, 2021 • 1