hiyouga
/

Qwen-14B-Chat-LLaMAfied

Text Generation

text-generation-inference

Model card Files Files and versions Community

hiyouga commited on Dec 25, 2023

Commit

064c439

·

1 Parent(s): e9354b6

Update README.md

Files changed (1) hide show

README.md +43 -0

README.md CHANGED Viewed

@@ -3,4 +3,47 @@ license: other
 license_name: qwen
 license_link: >-
   https://github.com/QwenLM/Qwen/blob/main/Tongyi%20Qianwen%20LICENSE%20AGREEMENT
 ---

 license_name: qwen
 license_link: >-
   https://github.com/QwenLM/Qwen/blob/main/Tongyi%20Qianwen%20LICENSE%20AGREEMENT
+language:
+- en
+- zh
+library_name: transformers
+pipeline_tag: text-generation
+tags:
+- llama
+- qwen
 ---
+This is the LLaMAfied version of [Qwen-14B-Chat](https://huggingface.co/Qwen/Qwen-14B-Chat) model by Alibaba Cloud.
+This model is converted with https://github.com/hiyouga/LLaMA-Factory/blob/main/tests/llamafy_qwen.py
+You may use this model for fine-tuning in downstream tasks, we recommend using our efficient fine-tuning toolkit. https://github.com/hiyouga/LLaMA-Factory
+- **Developed by:** Alibaba Cloud.
+- **Language(s) (NLP):** Chinese/English
+- **License:** [Tongyi Qianwen License](https://github.com/QwenLM/Qwen/blob/main/Tongyi%20Qianwen%20LICENSE%20AGREEMENT)
+Usage:
+```python
+from transformers import AutoModelForCausalLM, AutoTokenizer, TextStreamer
+tokenizer = AutoTokenizer.from_pretrained("hiyouga/Qwen-14B-Chat-LLaMAfied")
+model = AutoModelForCausalLM.from_pretrained("hiyouga/Qwen-14B-Chat-LLaMAfied").cuda()
+streamer = TextStreamer(tokenizer, skip_prompt=True, skip_special_tokens=True)
+query = (
+    "<|im_start|>system\nYou are a helpful assistant.<|im_end|>\n"
+    "<|im_start|>user\nWho are you?<|im_end|>\n"
+    "<|im_start|>assistant\n"
+)
+inputs = tokenizer([query], return_tensors="pt")
+inputs = inputs.to("cuda")
+generate_ids = model.generate(**inputs, eos_token_id=[151643, 151645], max_new_tokens=256, streamer=streamer)
+```
+You could also alternatively launch a CLI demo by using the script in [LLaMA-Factory](https://github.com/hiyouga/LLaMA-Factory)
+```bash
+python src/cli_demo.py --template qwen --model_name_or_path hiyouga/Qwen-14B-Chat-LLaMAfied
+```