qwp4w3hyb commited on May 13

Commit

31f6a88

•

1 Parent(s): 7be51cf

Upload folder using huggingface_hub

Browse files

Files changed (26) hide show

.gitattributes +20 -0
README.md +116 -0
config.json +28 -0
generation_config.json +7 -0
imat-bf16-gmerged.dat +3 -0
special_tokens_map.json +30 -0
tokenizer_config.json +53 -0
yi-1.5-9b-chat-bf16.gguf +3 -0
yi-1.5-9b-chat-imat-IQ1_S.gguf +3 -0
yi-1.5-9b-chat-imat-IQ2_M.gguf +3 -0
yi-1.5-9b-chat-imat-IQ2_S.gguf +3 -0
yi-1.5-9b-chat-imat-IQ2_XS.gguf +3 -0
yi-1.5-9b-chat-imat-IQ2_XXS.gguf +3 -0
yi-1.5-9b-chat-imat-IQ3_M.gguf +3 -0
yi-1.5-9b-chat-imat-IQ3_S.gguf +3 -0
yi-1.5-9b-chat-imat-IQ3_XS.gguf +3 -0
yi-1.5-9b-chat-imat-IQ3_XXS.gguf +3 -0
yi-1.5-9b-chat-imat-IQ4_NL.gguf +3 -0
yi-1.5-9b-chat-imat-IQ4_XS.gguf +3 -0
yi-1.5-9b-chat-imat-Q4_0.gguf +3 -0
yi-1.5-9b-chat-imat-Q4_K_M.gguf +3 -0
yi-1.5-9b-chat-imat-Q4_K_S.gguf +3 -0
yi-1.5-9b-chat-imat-Q5_K_M.gguf +3 -0
yi-1.5-9b-chat-imat-Q5_K_S.gguf +3 -0
yi-1.5-9b-chat-imat-Q6_K.gguf +3 -0
yi-1.5-9b-chat-imat-Q8_0.gguf +3 -0

.gitattributes CHANGED Viewed

@@ -33,3 +33,23 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+imat-bf16-gmerged.dat filter=lfs diff=lfs merge=lfs -text
+yi-1.5-9b-chat-bf16.gguf filter=lfs diff=lfs merge=lfs -text
+yi-1.5-9b-chat-imat-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text
+yi-1.5-9b-chat-imat-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text
+yi-1.5-9b-chat-imat-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text
+yi-1.5-9b-chat-imat-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text
+yi-1.5-9b-chat-imat-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text
+yi-1.5-9b-chat-imat-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text
+yi-1.5-9b-chat-imat-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text
+yi-1.5-9b-chat-imat-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text
+yi-1.5-9b-chat-imat-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text
+yi-1.5-9b-chat-imat-IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text
+yi-1.5-9b-chat-imat-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text
+yi-1.5-9b-chat-imat-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
+yi-1.5-9b-chat-imat-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
+yi-1.5-9b-chat-imat-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
+yi-1.5-9b-chat-imat-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
+yi-1.5-9b-chat-imat-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
+yi-1.5-9b-chat-imat-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
+yi-1.5-9b-chat-imat-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text

README.md ADDED Viewed

	@@ -0,0 +1,116 @@

+---
+license: apache-2.0
+pipeline_tag: text-generation
+base_model: 01-ai/Yi-1.5-9B-Chat
+tags:
+- yi
+- 01-ai
+- instruct
+- finetune
+- chatml
+- gguf
+- imatrix
+- importance matrix
+model-index:
+- name: 01-ai/Yi-1.5-9B-Chat-iMat-GGUF
+  results: []
+---
+# Quant Infos
+- quants done with an importance matrix for improved quantization loss
+- gguf & imatrix generated from bf16 for "optimal" accuracy loss (some say this is snake oil, but it can't hurt)
+- Wide coverage of different gguf quant types from Q\_8\_0 down to IQ1\_S
+- Quantized with [llama.cpp](https://github.com/ggerganov/llama.cpp) commit [dc685be46622a8fabfd57cfa804237c8f15679b8](https://github.com/ggerganov/llama.cpp/commit/dc685be46622a8fabfd57cfa804237c8f15679b8) (master as of 2024-05-12)
+- Imatrix generated with [this](https://github.com/ggerganov/llama.cpp/discussions/5263#discussioncomment-8395384) multi-purpose dataset.
+  ```
+  ./imatrix -c 512 -m $model_name-f16.gguf -f $llama_cpp_path/groups_merged.txt -o $out_path/imat-f16-gmerged.dat
+  ```
+# Original Model Card:
+<div align="center">
+<picture>
+  <img src="https://raw.githubusercontent.com/01-ai/Yi/main/assets/img/Yi_logo_icon_light.svg" width="150px">
+</picture>
+</div>
+<p align="center">
+  <a href="https://github.com/01-ai">🐙 GitHub</a> •
+  <a href="https://discord.gg/hYUwWddeAu">👾 Discord</a> •
+  <a href="https://twitter.com/01ai_yi">🐤 Twitter</a> •
+  <a href="https://github.com/01-ai/Yi-1.5/issues/2">💬 WeChat</a>
+  <br/>
+  <a href="https://arxiv.org/abs/2403.04652">📝 Paper</a> •
+  <a href="https://github.com/01-ai/Yi/tree/main?tab=readme-ov-file#faq">🙌 FAQ</a> •
+  <a href="https://github.com/01-ai/Yi/tree/main?tab=readme-ov-file#learning-hub">📗 Learning Hub</a>
+</p>
+# Intro
+Yi-1.5 is an upgraded version of Yi. It is continuously pre-trained on Yi with a high-quality corpus of 500B tokens and fine-tuned on 3M diverse fine-tuning samples.
+Compared with Yi, Yi-1.5 delivers stronger performance in coding, math, reasoning, and instruction-following capability, while still maintaining excellent capabilities in language understanding, commonsense reasoning, and reading comprehension.
+<div align="center">
+Model | Context Length | Pre-trained Tokens
+| :------------: | :------------: | :------------: |
+| Yi-1.5 | 4K | 3.6T
+</div>
+# Models
+- Chat models
+  <div align="center">
+  | Name            | Download                                                                                                                                                            |
+  | --------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
+  | Yi-1.5-34B-Chat | • [🤗 Hugging Face](https://huggingface.co/collections/01-ai/yi-15-2024-05-663f3ecab5f815a3eaca7ca8) • [🤖 ModelScope](https://www.modelscope.cn/organization/01ai) |
+  | Yi-1.5-9B-Chat  | • [🤗 Hugging Face](https://huggingface.co/collections/01-ai/yi-15-2024-05-663f3ecab5f815a3eaca7ca8) • [🤖 ModelScope](https://www.modelscope.cn/organization/01ai) |
+  | Yi-1.5-6B-Chat  | • [🤗 Hugging Face](https://huggingface.co/collections/01-ai/yi-15-2024-05-663f3ecab5f815a3eaca7ca8) • [🤖 ModelScope](https://www.modelscope.cn/organization/01ai) |
+  </div>
+- Base models
+  <div align="center">
+  | Name       | Download                                                                                                                                                            |
+  | ---------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
+  | Yi-1.5-34B | • [🤗 Hugging Face](https://huggingface.co/collections/01-ai/yi-15-2024-05-663f3ecab5f815a3eaca7ca8) • [🤖 ModelScope](https://www.modelscope.cn/organization/01ai) |
+  | Yi-1.5-9B  | • [🤗 Hugging Face](https://huggingface.co/collections/01-ai/yi-15-2024-05-663f3ecab5f815a3eaca7ca8) • [🤖 ModelScope](https://www.modelscope.cn/organization/01ai) |
+  | Yi-1.5-6B  | • [🤗 Hugging Face](https://huggingface.co/collections/01-ai/yi-15-2024-05-663f3ecab5f815a3eaca7ca8) • [🤖 ModelScope](https://www.modelscope.cn/organization/01ai) |
+  </div>
+# Benchmarks
+- Chat models
+  Yi-1.5-34B-Chat is on par with or excels beyond larger models in most benchmarks.
+  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/656d9adce8bf55919aca7c3f/KcsJ9Oc1VnEmfCDEJc5cd.png)
+  Yi-1.5-9B-Chat is the top performer among similarly sized open-source models.
+  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/656d9adce8bf55919aca7c3f/xf6pLg5jqRCwjlh6m3t6_.png)
+- Base models
+  Yi-1.5-34B is on par with or excels beyond larger models in some benchmarks.
+  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/656d9adce8bf55919aca7c3f/BwU7QM-03dZvZzwdIE1xY.png)
+  Yi-1.5-9B is the top performer among similarly sized open-source models.
+  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/656d9adce8bf55919aca7c3f/y-EYSYPT-3aWLJ0x8R94F.png)
+# Quick Start
+For getting up and running with Yi-1.5 models quickly, see [README](https://github.com/01-ai/Yi-1.5).

config.json ADDED Viewed

	@@ -0,0 +1,28 @@

+{
+  "architectures": [
+    "LlamaForCausalLM"
+  ],
+  "attention_bias": false,
+  "attention_dropout": 0.0,
+  "bos_token_id": 1,
+  "eos_token_id": 2,
+  "hidden_act": "silu",
+  "hidden_size": 4096,
+  "initializer_range": 0.02,
+  "intermediate_size": 11008,
+  "max_position_embeddings": 4096,
+  "model_type": "llama",
+  "num_attention_heads": 32,
+  "num_hidden_layers": 48,
+  "num_key_value_heads": 4,
+  "pad_token_id": 0,
+  "pretraining_tp": 1,
+  "rms_norm_eps": 1e-06,
+  "rope_scaling": null,
+  "rope_theta": 5000000.0,
+  "tie_word_embeddings": false,
+  "torch_dtype": "bfloat16",
+  "transformers_version": "4.40.0",
+  "use_cache": false,
+  "vocab_size": 64000
+}

generation_config.json ADDED Viewed

	@@ -0,0 +1,7 @@

+{
+  "_from_model_config": true,
+  "bos_token_id": 1,
+  "eos_token_id": 2,
+  "pad_token_id": 0,
+  "transformers_version": "4.40.0"
+}

imat-bf16-gmerged.dat ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:01d42f1d802b1c4273653ea9f568093c649b0d772c58ac25ddc92fa6b7ac1aa6
+size 6843305

special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,30 @@

+{
+  "bos_token": {
+    "content": "<|startoftext|>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "eos_token": {
+    "content": "<|im_end|>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "pad_token": {
+    "content": "<unk>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "unk_token": {
+    "content": "<unk>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  }
+}

tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,53 @@

+{
+  "add_bos_token": false,
+  "add_eos_token": false,
+  "add_prefix_space": true,
+  "added_tokens_decoder": {
+    "0": {
+      "content": "<unk>",
+      "lstrip": false,
+      "normalized": true,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "1": {
+      "content": "<|startoftext|>",
+      "lstrip": false,
+      "normalized": true,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "2": {
+      "content": "<|endoftext|>",
+      "lstrip": false,
+      "normalized": true,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "7": {
+      "content": "<|im_end|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    }
+  },
+  "bos_token": "<|startoftext|>",
+  "chat_template": "{% if messages[0]['role'] == 'system' %}{% set system_message = messages[0]['content'] %}{% endif %}{% if system_message is defined %}{{ system_message }}{% endif %}{% for message in messages %}{% set content = message['content'] %}{% if message['role'] == 'user' %}{{ '<|im_start|>user\\n' + content + '<|im_end|>\\n<|im_start|>assistant\\n' }}{% elif message['role'] == 'assistant' %}{{ content + '<|im_end|>' + '\\n' }}{% endif %}{% endfor %}",
+  "clean_up_tokenization_spaces": false,
+  "eos_token": "<|im_end|>",
+  "legacy": true,
+  "model_max_length": 4096,
+  "pad_token": "<unk>",
+  "padding_side": "right",
+  "sp_model_kwargs": {},
+  "spaces_between_special_tokens": false,
+  "split_special_tokens": false,
+  "tokenizer_class": "LlamaTokenizer",
+  "unk_token": "<unk>",
+  "use_default_system_prompt": false
+}

yi-1.5-9b-chat-bf16.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e52b0478e2c130787a83f115c5f87837924be91ccb5ea383977b4d9cc9fdb01f
+size 17661112576

yi-1.5-9b-chat-imat-IQ1_S.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:29249d91a2b62565e59adb36f1119b92cfdc8ba82fd42f968b177b69a7428b08
+size 2014573088

yi-1.5-9b-chat-imat-IQ2_M.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:b3366619f7f2b3889772bc32c17e64a567a8efc1525606cc6e1275615dc50130
+size 3098112544

yi-1.5-9b-chat-imat-IQ2_S.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4a858702b2b3a38d64bb3f94121cd9cf415a6a3775a7b94c43a194deab3b10c6
+size 2875355680

yi-1.5-9b-chat-imat-IQ2_XS.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e2b8f1c48084e66b3439b48be855c1bb1318820e3a96b43d5728673d7bf83ecb
+size 2708009504

yi-1.5-9b-chat-imat-IQ2_XXS.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4b7c8e730d27a18a733d3f21a9bea79c0df7ceca9a9cfb9ccc7b460a697af191
+size 2460086816

yi-1.5-9b-chat-imat-IQ3_M.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:618cf53c956809a288c687b350ce704ca31a86f6bb9f636fd22e5cbe192b4bc2
+size 4055462432

yi-1.5-9b-chat-imat-IQ3_S.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:fd104b004844eddf532c1f1cf0c8735773f92dcb96c82321a347f668fab19316
+size 3912577568

yi-1.5-9b-chat-imat-IQ3_XS.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:f419f27d87e5ac07e0c3d0b1b1f628d60f63b5fb6e3489b17a199c3eec1aa7d0
+size 3717935648

yi-1.5-9b-chat-imat-IQ3_XXS.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:cb0b612fa215f5dee01a2833e733582b5a26cef5ed90cb0699689ce095962694
+size 3474321952

yi-1.5-9b-chat-imat-IQ4_NL.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:1af5b4f21cf8341c7c086a768816e82d3bd67dd1e2717bebfd4b973b1b20d785
+size 5049578016

yi-1.5-9b-chat-imat-IQ4_XS.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:3c3966a4d7cee778aaf6f7308ced4512f6af922fcb3b5720e0f7bea5ac12e1b7
+size 4785009184

yi-1.5-9b-chat-imat-Q4_0.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:2756c1cf961624a0dbc9b1716feb0d4cdfcde60f55d14e298f1fc16aaa5711ac
+size 5053903392

yi-1.5-9b-chat-imat-Q4_K_M.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:6b7cc5223ae5a9b0b3ee0d6dd8e9caf2d4daf507eaa50cff4f692ea869bbcf6a
+size 5328957984

yi-1.5-9b-chat-imat-Q4_K_S.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:52b0395ab19b78ceae958ae7f9c6fff3087780cc4ef942f31c3937fb02cc0f80
+size 5071860256

yi-1.5-9b-chat-imat-Q5_K_M.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:7684de17efcb3d502f1a6bffbf76f6bbb46ea6a2cf074c85863f999e43d037dd
+size 6258258464

yi-1.5-9b-chat-imat-Q5_K_S.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c943666efb6a4485af01101d59ef90228e3ecb51e00d62ce572298dbc2ba1007
+size 6107853344

yi-1.5-9b-chat-imat-Q6_K.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:7e3b7c25b1589e47439ec5a3eed5760b473c0c9b997f620f8673d8b00d0ec1fe
+size 7245640224

yi-1.5-9b-chat-imat-Q8_0.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:2b2cb4a5c53d51c95dde5fdbd15939f56537f61b8e8a283a1cde6a76539af1d7
+size 9383916064