Update README.md (#1)
Browse files- Update README.md (90eced2ccabe9cf8bbe294f5f2d50f2d97e111b8)
Co-authored-by: Pengzhi Gao <[email protected]>
README.md
CHANGED
@@ -46,13 +46,7 @@ GemmaX2-28-9B-Pretrain is a language model developed through continual pretraini
|
|
46 |
- **Model type:** GemmaX2-28-9B-Pretrain is obtained by continually pretraining Gemma2-9B on a large amount of monolingual and parallel data. Subsequently, GemmaX2-28-9B-v0.1 is derived through supervised finetuning on a small set of high-quality translation instruction data.
|
47 |
- **Languages:** Arabic, Bengali, Czech, German, English, Spanish, Persian, French, Hebrew, Hindi, Indonesian, Italian, Japanese, Khmer, Korean, Lao, Malay, Burmese, Dutch, polish, Portuguese, Russian, Thai, Tagalog, Turkish, Urdu, Vietnamese, Chinese.
|
48 |
|
49 |
-
|
50 |
-
|
51 |
-
- paper: [Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical Study](https://arxiv.org/pdf/2502.02481)
|
52 |
-
|
53 |
-
## Model Performance
|
54 |
-
|
55 |
-

|
56 |
|
57 |
|
58 |
## Training Data
|
|
|
46 |
- **Model type:** GemmaX2-28-9B-Pretrain is obtained by continually pretraining Gemma2-9B on a large amount of monolingual and parallel data. Subsequently, GemmaX2-28-9B-v0.1 is derived through supervised finetuning on a small set of high-quality translation instruction data.
|
47 |
- **Languages:** Arabic, Bengali, Czech, German, English, Spanish, Persian, French, Hebrew, Hindi, Indonesian, Italian, Japanese, Khmer, Korean, Lao, Malay, Burmese, Dutch, polish, Portuguese, Russian, Thai, Tagalog, Turkish, Urdu, Vietnamese, Chinese.
|
48 |
|
49 |
+
**Note that GemmaX2-28-9B-Pretrain is NOT translation model.**
|
|
|
|
|
|
|
|
|
|
|
|
|
50 |
|
51 |
|
52 |
## Training Data
|