atsuki-yamaguchi commited on
Commit
56794e8
·
verified ·
1 Parent(s): 461a592

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +62 -0
README.md ADDED
@@ -0,0 +1,62 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ ---
3
+ license: apache-2.0
4
+ datasets:
5
+ - allenai/MADLAD-400
6
+ language:
7
+ - ta
8
+ base_model:
9
+ - Qwen/Qwen2.5-7B-Instruct
10
+ library_name: transformers
11
+ ---
12
+ # Qwen2.5 7B Instruct for Tamil: Continual pre-training only
13
+
14
+ This model is built on top of Qwen2.5 7B Instruct adapted for Tamil using 500M target language tokens sampled from MADLAD-400.
15
+
16
+ ## Model Details
17
+
18
+ * **Vocabulary**: This model has no additional target vocabulary. It retains the original vocabulary of Qwen2.5 7B Instruct.
19
+ * **Training**: This model was continually pre-trained on 500M target language tokens sampled from MADLAD-400.
20
+
21
+
22
+ ## Model Description
23
+
24
+ - **Language:** Tamil
25
+ - **License:** Apache 2.0
26
+ - **Fine-tuned from model:** Qwen/Qwen2.5-7B-Instruct
27
+
28
+
29
+ ## Model Sources
30
+
31
+ - **Repository:** https://github.com/gucci-j/chat-cve
32
+ - **Paper:** https://arxiv.org/abs/2412.11704
33
+
34
+
35
+ ## How to Get Started with the Model
36
+ Use the code below to get started with the model.
37
+ ```python
38
+ from transformers import AutoTokenizer, AutoModelForCausalLM
39
+
40
+ model = AutoModelForCausalLM.from_pretrained(
41
+ "atsuki-yamaguchi/Qwen2.5-7B-Instruct-ta-lapt-madlad"
42
+ )
43
+ tokenizer = AutoTokenizer.from_pretrained(
44
+ "Qwen/Qwen2.5-7B-Instruct"
45
+ )
46
+ ```
47
+
48
+
49
+ ## Citation
50
+ ```
51
+ @misc{yamaguchi2024vocabularyexpansionchatmodels,
52
+ title={{ElChat}: Adapting Chat Language Models Using Only Target Unlabeled Language Data},
53
+ author={Atsuki Yamaguchi and Terufumi Morishita and Aline Villavicencio and Nikolaos Aletras},
54
+ year={2024},
55
+ eprint={2412.11704},
56
+ archivePrefix={arXiv},
57
+ primaryClass={cs.CL},
58
+ url={https://arxiv.org/abs/2412.11704},
59
+ }
60
+ ```
61
+
62
+