Update README.md
Browse files
README.md
CHANGED
@@ -17,7 +17,7 @@ library_name: transformers
|
|
17 |
**EMMA-500 Llama 3 8B** is a state-of-the-art multilingual language model designed to improve language representation, especially in low-resource languages, through continual pre-training on the **Llama 3 8B** architecture. Leveraging the **[MaLA Corpus](https://huggingface.co/collections/MaLA-LM/mala-corpus-66e05127641a51de34d39529)**, which spans over 500 languages and is augmented with books, code, instruction data, and papers, EMMA-500 excels in multilingual tasks like commonsense reasoning, machine translation, and text classification.
|
18 |
|
19 |
- Project Website: https://mala-lm.github.io/emma-500-gen2.html
|
20 |
-
- Paper:
|
21 |
|
22 |
---
|
23 |
|
|
|
17 |
**EMMA-500 Llama 3 8B** is a state-of-the-art multilingual language model designed to improve language representation, especially in low-resource languages, through continual pre-training on the **Llama 3 8B** architecture. Leveraging the **[MaLA Corpus](https://huggingface.co/collections/MaLA-LM/mala-corpus-66e05127641a51de34d39529)**, which spans over 500 languages and is augmented with books, code, instruction data, and papers, EMMA-500 excels in multilingual tasks like commonsense reasoning, machine translation, and text classification.
|
18 |
|
19 |
- Project Website: https://mala-lm.github.io/emma-500-gen2.html
|
20 |
+
- Paper: https://arxiv.org/abs/2506.00469
|
21 |
|
22 |
---
|
23 |
|