MaLA Corpus for Massive Language Adaptation of Large Language Models https://mala-lm.github.io
MaLA-LM
community
AI & ML interests
NLP & LLM
Recent Activity
View all activity
MaLA-500: Massive Language Adaptation of Large Language Models https://mala-lm.github.io
Benchmarks in many languages
Enhancing massively multilingual adaptation of LLMs on 500+ languages https://mala-lm.github.io
-
Massively Multilingual Adaptation of Large Language Models Using Bilingual Translation Data
Paper • 2506.00469 • Published • 2 -
MaLA-LM/emma-500-llama3-8b-mono
Text Generation • 8B • Updated • 31 -
MaLA-LM/emma-500-llama3-8b-bi
Text Generation • 8B • Updated • 53 -
MaLA-LM/emma-500-llama3.1-8b-mono
Text Generation • 8B • Updated • 39
Ji, S., & Chen, P. (2025). How Many Languages Make Good Multilingual Instruction Tuning? A Case Study on BLOOM. In Proceedings of COLING 2025.
MaLA Corpus for Massive Language Adaptation of Large Language Models https://mala-lm.github.io
Enhancing massively multilingual adaptation of LLMs on 500+ languages https://mala-lm.github.io
-
Massively Multilingual Adaptation of Large Language Models Using Bilingual Translation Data
Paper • 2506.00469 • Published • 2 -
MaLA-LM/emma-500-llama3-8b-mono
Text Generation • 8B • Updated • 31 -
MaLA-LM/emma-500-llama3-8b-bi
Text Generation • 8B • Updated • 53 -
MaLA-LM/emma-500-llama3.1-8b-mono
Text Generation • 8B • Updated • 39
MaLA-500: Massive Language Adaptation of Large Language Models https://mala-lm.github.io
Ji, S., & Chen, P. (2025). How Many Languages Make Good Multilingual Instruction Tuning? A Case Study on BLOOM. In Proceedings of COLING 2025.
Benchmarks in many languages