abhinand commited on
Commit
255d26f
1 Parent(s): 373c83c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +14 -3
README.md CHANGED
@@ -9,7 +9,7 @@ license: gpl-3.0
9
 
10
  Welcome to the inaugural release of the Tamil LLaMA 7B instruct model – an important step in advancing LLMs for the Tamil language. This model is ready for immediate inference and is also primed for further fine-tuning to cater to your specific NLP tasks.
11
 
12
- To dive deep into the development and capabilities of this model, please read the [research paper](https://arxiv.org/abs/) and the [introductory blog post](https://intro.com) that outlines our journey and the model's potential impact.
13
 
14
  ## Model description
15
 
@@ -20,6 +20,7 @@ The Tamil LLaMA models have been enhanced and tailored specifically with an exte
20
  - **License:** GNU General Public License v3.0
21
  - **Finetuned from model:** [abhinand/tamil-llama-7b-base-v0.1](https://huggingface.co/abhinand/tamil-llama-7b-base-v0.1)
22
  - **Training Precision:** `float16`
 
23
 
24
  ## Prompting Format
25
 
@@ -66,7 +67,17 @@ Get to know the creators behind this innovative model and follow their contribut
66
 
67
  ## Citation
68
 
69
- *Citation info will be added here once the paper is published*
70
-
 
 
 
 
 
 
 
 
 
 
71
 
72
  We hope this model serves as a valuable tool in your NLP toolkit and look forward to seeing the advancements it will enable in the understanding and generation of the Tamil language.
 
9
 
10
  Welcome to the inaugural release of the Tamil LLaMA 7B instruct model – an important step in advancing LLMs for the Tamil language. This model is ready for immediate inference and is also primed for further fine-tuning to cater to your specific NLP tasks.
11
 
12
+ To dive deep into the development and capabilities of this model, please read the [research paper](https://arxiv.org/abs/2311.05845) and the [introductory blog post (WIP)]() that outlines our journey and the model's potential impact.
13
 
14
  ## Model description
15
 
 
20
  - **License:** GNU General Public License v3.0
21
  - **Finetuned from model:** [abhinand/tamil-llama-7b-base-v0.1](https://huggingface.co/abhinand/tamil-llama-7b-base-v0.1)
22
  - **Training Precision:** `float16`
23
+ - **Code:** [GitHub](https://github.com/abhinand5/tamil-llama)
24
 
25
  ## Prompting Format
26
 
 
67
 
68
  ## Citation
69
 
70
+ If you use this model or any of the the Tamil-Llama datasets in your research, please cite:
71
+
72
+ ```bibtex
73
+ @misc{balachandran2023tamilllama,
74
+ title={Tamil-Llama: A New Tamil Language Model Based on Llama 2},
75
+ author={Abhinand Balachandran},
76
+ year={2023},
77
+ eprint={2311.05845},
78
+ archivePrefix={arXiv},
79
+ primaryClass={cs.CL}
80
+ }
81
+ ```
82
 
83
  We hope this model serves as a valuable tool in your NLP toolkit and look forward to seeing the advancements it will enable in the understanding and generation of the Tamil language.