TheBloke
/

WizardLM-13B-1.0-fp16

Text Generation

Transformers

PyTorch

llama

text-generation-inference

Model card Files Files and versions Community

TheBloke commited on Jun 5, 2023

Commit

b797338

•

1 Parent(s): a711f0b

Update README.md

Browse files

Files changed (1) hide show

README.md +29 -14

README.md CHANGED Viewed

@@ -3,17 +3,19 @@ inference: false
 license: other
 ---
 <div style="width: 100%;">
     <img src="https://i.imgur.com/EBdldam.jpg" alt="TheBlokeAI" style="width: 100%; min-width: 400px; display: block; margin: auto;">
 </div>
 <div style="display: flex; justify-content: space-between; width: 100%;">
     <div style="display: flex; flex-direction: column; align-items: flex-start;">
-        <p><a href="https://discord.gg/UBgz4VXf">Chat & support: my new Discord server</a></p>
     </div>
     <div style="display: flex; flex-direction: column; align-items: flex-end;">
-        <p><a href="https://www.patreon.com/TheBlokeAI">Want to contribute? Patreon coming soon!</a></p>
     </div>
 </div>
 # WizardLM 13B 1.0 fp16
@@ -39,18 +41,31 @@ USER: prompt goes here
 ASSISTANT:
 ```
-## Want to support my work?
-I've had a lot of people ask if they can contribute. I love providing models and helping people, but it is starting to rack up pretty big cloud computing bills.
-So if you're able and willing to contribute, it'd be most gratefully received and will help me to keep providing models, and work on various AI projects.
-Donaters will get priority support on any and all AI/LLM/model questions, and I'll gladly quantise any model you'd like to try.
-* Patreon: coming soon! (just awaiting approval)
 * Ko-Fi: https://ko-fi.com/TheBlokeAI
-* Discord: https://discord.gg/UBgz4VXf
 # Original model card
 ## WizardLM: An Instruction-following LLM Using Evol-Instruct
@@ -80,7 +95,7 @@ At present, our core contributors are preparing the **33B** version and we expec
 ### GPT-4 automatic evaluation
-We adopt the automatic evaluation framework based on GPT-4 proposed by FastChat to assess the performance of chatbot models. As shown in the following figure, WizardLM-13B achieved better results than Vicuna-13b.
 <p align="center" width="100%">
 <a ><img src="imgs/WizarLM13b-GPT4.png" alt="WizardLM" style="width: 100%; min-width: 300px; display: block; margin: auto;"></a>
 </p>
@@ -161,7 +176,7 @@ This JSON file is a list of dictionaries, each dictionary contains the following
 We release [WizardLM] weights as delta weights to comply with the LLaMA model license.
 You can add our delta to the original LLaMA weights to obtain the WizardLM weights. Instructions:
 1. Get the original LLaMA weights in the huggingface format by following the instructions [here](https://huggingface.co/docs/transformers/main/model_doc/llama).
-2. Please download our delta model at the following [link](https://huggingface.co/victor123/WizardLM)
 3. Use the following scripts to get WizardLM weights by applying our delta:
 ```
 python src/weight_diff_wizard.py recover --path_raw <path_to_step_1_dir> --path_diff <path_to_step_2_dir> --path_tuned <path_to_store_recovered_weights>
@@ -224,9 +239,9 @@ python src\inference_wizardlm.py
 ### Evaluation
-To evaluate Wizard, we conduct human evaluation on the inputs from our human instruct evaluation set [`WizardLM_testset.jsonl`](./data/WizardLM_testset.jsonl) . This evaluation set was collected by the authors and covers a diverse list of user-oriented instructions including difficult Coding Generation & Debugging, Math, Reasoning, Complex Formats, Academic Writing, Extensive Disciplines, and so on. We performed a blind pairwise comparison between Wizard and baselines. Specifically, we recruit 10 well-educated annotators to rank the models from 1 to 5 on relevance, knowledgeable, reasoning, calculation and accuracy.
-WizardLM achieved significantly better results than Alpaca and Vicuna-7b.
 <p align="center" width="60%">
 <a ><img src="imgs/win.png" alt="WizardLM" style="width: 60%; min-width: 300px; display: block; margin: auto;"></a>
 </p>
@@ -242,7 +257,7 @@ Please cite the repo if you use the data or code in this repo.
 ```
 @misc{xu2023wizardlm,
-      title={WizardLM: Empowering Large Language Models to Follow Complex Instructions},
       author={Can Xu and Qingfeng Sun and Kai Zheng and Xiubo Geng and Pu Zhao and Jiazhan Feng and Chongyang Tao and Daxin Jiang},
       year={2023},
       eprint={2304.12244},

 license: other
 ---
+<!-- header start -->
 <div style="width: 100%;">
     <img src="https://i.imgur.com/EBdldam.jpg" alt="TheBlokeAI" style="width: 100%; min-width: 400px; display: block; margin: auto;">
 </div>
 <div style="display: flex; justify-content: space-between; width: 100%;">
     <div style="display: flex; flex-direction: column; align-items: flex-start;">
+        <p><a href="https://discord.gg/Jq4vkcDakD">Chat & support: my new Discord server</a></p>
     </div>
     <div style="display: flex; flex-direction: column; align-items: flex-end;">
+        <p><a href="https://www.patreon.com/TheBlokeAI">Want to contribute? TheBloke's Patreon page</a></p>
     </div>
 </div>
+<!-- header end -->
 # WizardLM 13B 1.0 fp16
 ASSISTANT:
 ```
+<!-- footer start -->
+## Discord
+For further support, and discussions on these models and AI in general, join us at:
+[TheBloke AI's Discord server](https://discord.gg/Jq4vkcDakD)
+## Thanks, and how to contribute.
+Thanks to the [chirper.ai](https://chirper.ai) team!
+I've had a lot of people ask if they can contribute. I enjoy providing models and helping people, and would love to be able to spend even more time doing it, as well as expanding into new projects like fine tuning/training.
+If you're able and willing to contribute it will be most gratefully received and will help me to keep providing more models, and to start work on new AI projects.
+Donaters will get priority support on any and all AI/LLM/model questions and requests, access to a private Discord room, plus other benefits.
+* Patreon: https://patreon.com/TheBlokeAI
 * Ko-Fi: https://ko-fi.com/TheBlokeAI
+**Patreon special mentions**: Aemon Algiz, Dmitriy Samsonov, Nathan LeClaire, Trenton Dambrowitz, Mano Prime, David Flickinger, vamX, Nikolai Manek, senxiiz, Khalefa Al-Ahmad, Illia Dulskyi, Jonathan Leane, Talal Aujan, V. Lukas, Joseph William Delisle, Pyrater, Oscar Rangel, Lone Striker, Luke Pendergrass, Eugene Pentland, Sebastain Graf, Johann-Peter Hartman.
+Thank you to all my generous patrons and donaters!
+<!-- footer end -->
 # Original model card
 ## WizardLM: An Instruction-following LLM Using Evol-Instruct
 ### GPT-4 automatic evaluation
+We adopt the automatic evaluation framework based on GPT-4 proposed by FastChat to assess the performance of chatbot models. As shown in the following figure, WizardLM-13B achieved better results than Vicuna-13b.
 <p align="center" width="100%">
 <a ><img src="imgs/WizarLM13b-GPT4.png" alt="WizardLM" style="width: 100%; min-width: 300px; display: block; margin: auto;"></a>
 </p>
 We release [WizardLM] weights as delta weights to comply with the LLaMA model license.
 You can add our delta to the original LLaMA weights to obtain the WizardLM weights. Instructions:
 1. Get the original LLaMA weights in the huggingface format by following the instructions [here](https://huggingface.co/docs/transformers/main/model_doc/llama).
+2. Please download our delta model at the following [link](https://huggingface.co/victor123/WizardLM)
 3. Use the following scripts to get WizardLM weights by applying our delta:
 ```
 python src/weight_diff_wizard.py recover --path_raw <path_to_step_1_dir> --path_diff <path_to_step_2_dir> --path_tuned <path_to_store_recovered_weights>
 ### Evaluation
+To evaluate Wizard, we conduct human evaluation on the inputs from our human instruct evaluation set [`WizardLM_testset.jsonl`](./data/WizardLM_testset.jsonl) . This evaluation set was collected by the authors and covers a diverse list of user-oriented instructions including difficult Coding Generation & Debugging, Math, Reasoning, Complex Formats, Academic Writing, Extensive Disciplines, and so on. We performed a blind pairwise comparison between Wizard and baselines. Specifically, we recruit 10 well-educated annotators to rank the models from 1 to 5 on relevance, knowledgeable, reasoning, calculation and accuracy.
+WizardLM achieved significantly better results than Alpaca and Vicuna-7b.
 <p align="center" width="60%">
 <a ><img src="imgs/win.png" alt="WizardLM" style="width: 60%; min-width: 300px; display: block; margin: auto;"></a>
 </p>
 ```
 @misc{xu2023wizardlm,
+      title={WizardLM: Empowering Large Language Models to Follow Complex Instructions},
       author={Can Xu and Qingfeng Sun and Kai Zheng and Xiubo Geng and Pu Zhao and Jiazhan Feng and Chongyang Tao and Daxin Jiang},
       year={2023},
       eprint={2304.12244},