M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment Analysis Paper • 2502.11824 • Published 12 days ago
Understanding In-Context Machine Translation for Low-Resource Languages: A Case Study on Manchu Paper • 2502.11862 • Published 12 days ago
NoLiMa: Long-Context Evaluation Beyond Literal Matching Paper • 2502.05167 • Published 21 days ago • 15
GlotCC: An Open Broad-Coverage CommonCrawl Corpus and Pipeline for Minority Languages Paper • 2410.23825 • Published Oct 31, 2024 • 4
LangSAMP: Language-Script Aware Multilingual Pretraining Paper • 2409.18199 • Published Sep 26, 2024 • 1
MEXA: Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment Paper • 2410.05873 • Published Oct 8, 2024 • 3