EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language Models Paper • 2409.17892 • Published Sep 26, 2024 • 2
GlotEval: A Test Suite for Massively Multilingual Evaluation of Large Language Models Paper • 2504.04155 • Published Apr 5 • 1
Rethinking Multilingual Continual Pretraining: Data Mixing for Adapting LLMs Across Languages and Resources Paper • 2504.04152 • Published Apr 5 • 1
Massively Multilingual Adaptation of Large Language Models Using Bilingual Translation Data Paper • 2506.00469 • Published May 31 • 2
A Comparison of Language Modeling and Translation as Multilingual Pretraining Objectives Paper • 2407.15489 • Published Jul 22, 2024
Scaling Low-Resource MT via Synthetic Data Generation with LLMs Paper • 2505.14423 • Published May 20
DeltaProduct: Improving State-Tracking in Linear RNNs via Householder Products Paper • 2502.10297 • Published Feb 14
Got Compute, but No Data: Lessons From Post-training a Finnish LLM Paper • 2503.09407 • Published Mar 12 • 1
An Expanded Massive Multilingual Dataset for High-Performance Language Technologies Paper • 2503.10267 • Published Mar 13 • 1
ARLBench: Flexible and Efficient Benchmarking for Hyperparameter Optimization in Reinforcement Learning Paper • 2409.18827 • Published Sep 27, 2024
Massively Multilingual Adaptation of Large Language Models Using Bilingual Translation Data Paper • 2506.00469 • Published May 31 • 2
An Expanded Massive Multilingual Dataset for High-Performance Language Technologies Paper • 2503.10267 • Published Mar 13 • 1
An Expanded Massive Multilingual Dataset for High-Performance Language Technologies Paper • 2503.10267 • Published Mar 13 • 1
GPT-SW3: An Autoregressive Language Model for the Nordic Languages Paper • 2305.12987 • Published May 22, 2023
A New Massive Multilingual Dataset for High-Performance Language Technologies Paper • 2403.14009 • Published Mar 20, 2024 • 1
Uncertainty-Aware Natural Language Inference with Stochastic Weight Averaging Paper • 2304.04726 • Published Apr 10, 2023
Sentence Embeddings in NLI with Iterative Refinement Encoders Paper • 1808.08762 • Published Aug 27, 2018