WizardLM-13B-v1.2-Sharded-8GB / README.md

alim

Create README.md

52250d1 over 1 year ago

4.88 kB

	---
	license: llama2
	pipeline_tag: text-generation
	---
	# Disclaimer: I do not own the weights of WizardLM-13B-V1.2, nor did I train the model. I only sharded or split the model weights.

	The actual weights can be found [here](https://huggingface.co/WizardLM/WizardLM-13B-V1.2).

	The rest of the README is copied from the same page listed above.



	This is the Full-Weight of WizardLM-13B V1.2 model, this model is trained from Llama-2 13b.

	## WizardLM: Empowering Large Pre-Trained Language Models to Follow Complex Instructions



	<p align="center">
	🤗 <a href="https://huggingface.co/WizardLM" target="_blank">HF Repo</a> • 🐦 <a href="https://twitter.com/WizardLM_AI" target="_blank">Twitter</a> • 📃 <a href="https://arxiv.org/abs/2304.12244" target="_blank">[WizardLM]</a> • 📃 <a href="https://arxiv.org/abs/2306.08568" target="_blank">[WizardCoder]</a> <br>
	</p>
	<p align="center">
	👋 Join our <a href="https://discord.gg/bpmeZD7V" target="_blank">Discord</a>
	</p>


	<font size=4>

	\| <sup>Model</sup> \| <sup>Checkpoint</sup> \| <sup>Paper</sup> \|<sup>MT-Bench</sup> \| <sup>AlpacaEval</sup> \| <sup>WizardEval</sup> \| <sup>HumanEval</sup> \| <sup>License</sup>\|
	\| ----- \|------\| ---- \|------\|-------\| ----- \| ----- \| ----- \|
	\| <sup>WizardLM-13B-V1.2</sup> \| <sup>🤗 <a href="https://huggingface.co/WizardLM/WizardLM-13B-V1.2" target="_blank">HF Link</a> </sup>\| \| <sup>7.06</sup> \| <sup>89.17%</sup> \| <sup>101.4% </sup>\|<sup>36.6 pass@1</sup>\|<sup> <a href="https://ai.meta.com/resources/models-and-libraries/llama-downloads/" target="_blank">Llama 2 License </a></sup> \|
	\| <sup>WizardLM-13B-V1.1</sup> \|<sup> 🤗 <a href="https://huggingface.co/WizardLM/WizardLM-13B-V1.1" target="_blank">HF Link</a> </sup> \| \| <sup>6.76</sup> \|<sup>86.32%</sup> \| <sup>99.3% </sup> \|<sup>25.0 pass@1</sup>\| <sup>Non-commercial</sup>\|
	\| <sup>WizardLM-30B-V1.0</sup> \| <sup>🤗 <a href="https://huggingface.co/WizardLM/WizardLM-30B-V1.0" target="_blank">HF Link</a></sup> \| \| <sup>7.01</sup> \| \| <sup>97.8% </sup> \| <sup>37.8 pass@1</sup>\| <sup>Non-commercial</sup> \|
	\| <sup>WizardLM-13B-V1.0</sup> \| <sup>🤗 <a href="https://huggingface.co/WizardLM/WizardLM-13B-V1.0" target="_blank">HF Link</a> </sup> \| \| <sup>6.35</sup> \| <sup>75.31%</sup> \| <sup>89.1% </sup> \|<sup> 24.0 pass@1 </sup> \| <sup>Non-commercial</sup>\|
	\| <sup>WizardLM-7B-V1.0 </sup>\| <sup>🤗 <a href="https://huggingface.co/WizardLM/WizardLM-7B-V1.0" target="_blank">HF Link</a> </sup> \|<sup> 📃 <a href="https://arxiv.org/abs/2304.12244" target="_blank">[WizardLM]</a> </sup>\| \| \| <sup>78.0% </sup> \|<sup>19.1 pass@1 </sup>\|<sup> Non-commercial</sup>\|
	\| <sup>WizardCoder-15B-V1.0</sup> \| <sup> 🤗 <a href="https://huggingface.co/WizardLM/WizardCoder-15B-V1.0" target="_blank">HF Link</a></sup> \| <sup>📃 <a href="https://arxiv.org/abs/2306.08568" target="_blank">[WizardCoder]</a></sup> \| \|\| \|<sup> 57.3 pass@1 </sup> \| <sup> <a href="https://huggingface.co/spaces/bigcode/bigcode-model-license-agreement" target="_blank">OpenRAIL-M</a></sup> \|
	</font>

	Repository: https://github.com/nlpxucan/WizardLM

	Twitter:


	- 🔥🔥🔥 [7/25/2023] We released WizardLM V1.2 models. The WizardLM-13B-V1.2 is here ([Demo_13B-V1.2](https://b7a19878988c8c73.gradio.app), [Demo_13B-V1.2_bak-1](https://d0a37a76e0ac4b52.gradio.app/), [Full Model Weight](https://huggingface.co/WizardLM/WizardLM-13B-V1.2)). Please checkout the [paper](https://arxiv.org/abs/2304.12244).
	- 🔥🔥🔥 [7/25/2023] The WizardLM-13B-V1.2 achieves 7.06 on [MT-Bench Leaderboard](https://chat.lmsys.org/?leaderboard), 89.17% on [AlpacaEval Leaderboard](https://tatsu-lab.github.io/alpaca_eval/), and 101.4% on [WizardLM Eval](https://github.com/nlpxucan/WizardLM/blob/main/WizardLM/data/WizardLM_testset.jsonl). (Note: MT-Bench and AlpacaEval are all self-test, will push update and request review. All tests are completed under their official settings.)

	❗<b>Note for model system prompts usage:</b>


	<b>WizardLM</b> adopts the prompt format from <b>Vicuna</b> and supports multi-turn conversation. The prompt should be as following:

	```
	A chat between a curious user and an artificial intelligence assistant. The assistant gives helpful, detailed, and polite answers to the user's questions.
	USER: Hi
	ASSISTANT: Hello.
	USER: Who are you?
	ASSISTANT: I am WizardLM.
	......
	```

	❗<b>To commen concern about dataset:</b>

	Recently, there have been clear changes in the open-source policy and regulations of our overall organization's code, data, and models.


	Despite this, we have still worked hard to obtain opening the weights of the model first, but the data involves stricter auditing and is in review with our legal team .

	Our researchers have no authority to publicly release them without authorization.

	Thank you for your understanding.