upload auto_round format

Browse files

Signed-off-by: sys-lpot-val <[email protected]>

Files changed (4) hide show

README.md +28 -21
config.json +2 -2
model.safetensors +2 -2
quantization_config.json +2 -2

README.md CHANGED Viewed

@@ -1,6 +1,11 @@
 ## Model Details
-This model is an int4 model with group_size 128 of [Qwen/Qwen2.5-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct) generated by [intel/auto-round](https://github.com/intel/auto-round)
 ## How To Use
@@ -16,8 +21,9 @@ tokenizer = AutoTokenizer.from_pretrained(quantized_model_dir)
 model = AutoModelForCausalLM.from_pretrained(
     quantized_model_dir,
-    torch_dtype='float16',
     device_map="auto",
 )
 ##import habana_frameworks.torch.core as htcore ## uncommnet it for HPU
@@ -52,9 +58,8 @@ print(response)
 prompt = "There is a girl who likes adventure,"
 ## INT4:
 """That's great to hear! What kind of adventure does the girl like? Is there anything specific she enjoys doing or exploring?"""
 ## BF16:
-"""That's great to hear! What kind of adventure does the girl like? Is there anything specific she enjoys doing or exploring?"""
 prompt = "9.11和9.8哪个数字大"
@@ -76,41 +81,43 @@ prompt = "9.11和9.8哪个数字大"
 最终答案：9.8更大。
 """
 ##BF16:
 """
 要比较9.11和9.8的大小，我们可以按照以下步骤进行：
-1. 首先，将两个数都转换为相同的小数形式。这里我们使用小数点前的零来方便比较。
-   9.11 = 9.1100 (保留两位小数)
-   9.8 = 9.8000 (保留两位小数)
-2. 现在，比较这两个小数：
-   - 第一位：9 和 9 相等。
-   - 第二位：第一位是相同的，都是1。
-   - 第三位：第一个数是1，第二个数是8，所以8 > 1。
-因此，9.8大于9.11。
-最终答案：9.8更大。
 """
 prompt = "Once upon a time,"
 ##INT4:
 """I'm sorry, but I don't understand what you're asking me to do or what information you want me to provide. Could you please clarify your question or provide more context? I'd be happy to help if you can give me all the information you need."""
 ##BF16:
-"""I'm sorry, but I don't understand what you're asking me to do or what information you want me to provide. Could you please clarify your question or provide more context? I'd be happy to help if you can give me all the information you need."""
-prompt = "请简短介绍一下阿里巴巴公司"
 ##INT4:
 """阿里巴巴集团是全球领先的电子商务和云计算服务提供商，成立于1999年。该公司总部位于中国杭州，并在多个国家和地区设有办事处和运营中心。阿里巴巴集团的业务包括在线零售、移动支付、云计算、人工智能等。阿里巴巴集团是中国最大的电子商务平台之一，也是全球最大的电商平台之一。阿里巴巴集团还拥有众多子公司和品牌，如淘宝、天猫、菜鸟网络等。阿里巴巴集团在全球范围内拥有超过20亿活跃用户，每年销售额超过3500亿美元。阿里巴巴集团致力于通过创新和智能化技术推动商业变革，为消费者提供更便捷、更个性化的购物体验。"""
 ##BF16:
-"""阿里巴巴集团是全球领先的电子商务和云计算服务提供商，成立于1999年。该公司总部位于中国杭州，并在多个国家和地区设有办事处和运营中心。阿里巴巴集团的业务包括在线零售、移动支付、云计算、人工智能等。阿里巴巴集团是中国最大的电子商务平台之一，也是全球最大的电商平台之一。阿��巴巴集团还拥有众多子公司和品牌，如淘宝、天猫、菜鸟网络等。阿里巴巴集团在全球范围内拥有超过20亿活跃用户，每年销售额超过3500亿美元。阿里巴巴集团致力于通过创新和智能化技术推动商业变革，为消费者提供更便捷、更个性化的购物体验。"""
 ```
 ### Evaluate the model
@@ -124,9 +131,9 @@ auto-round --model "OPEA/Qwen2.5-0.5B-Instruct-int4-inc" --eval --eval_bs 16  --
 | Metric                                     |  BF16  |  INT4  |
 | :----------------------------------------- | :----: | :----: |
 | Avg                                        | 0.4229 | 0.4124 |
 | leaderboard_ifeval inst_level_strict_acc   | 0.3501 | 0.3441 |
 | leaderboard_ifeval prompt_level_strict_acc | 0.2107 | 0.2218 |
-| leaderboard_mmlu_pro 5 shots               | 0.1877 | 0.1678 |
 | mmlu                                       | 0.4582 | 0.4434 |
 | cmmlu                                      | 0.5033 | 0.4542 |
 | ceval-valid                                | 0.5327 | 0.4918 |
@@ -145,7 +152,7 @@ auto-round --model "OPEA/Qwen2.5-0.5B-Instruct-int4-inc" --eval --eval_bs 16  --
 ### Generate the model
-Here is the sample command to reproduce the model. We observed a larger accuracy drop in Chinese tasks and recommend using a high-quality Chinese dataset for calibration or smaller group_size like 32.
 ```bash
 auto-round \

+---
+license: apache-2.0
+datasets:
+- NeelNanda/pile-10k
+---
 ## Model Details
+This model is an int4 model with group_size 128 and symmetric quantization of [Qwen/Qwen2.5-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct) generated by [intel/auto-round](https://github.com/intel/auto-round).  Load the model with `revision="7cac2d1"` to use AutoGPTQ format
 ## How To Use
 model = AutoModelForCausalLM.from_pretrained(
     quantized_model_dir,
+    torch_dtype='auto',
     device_map="auto",
+    ##revision="7cac2d1" ##AutoGPTQ format
 )
 ##import habana_frameworks.torch.core as htcore ## uncommnet it for HPU
 prompt = "There is a girl who likes adventure,"
 ## INT4:
 """That's great to hear! What kind of adventure does the girl like? Is there anything specific she enjoys doing or exploring?"""
 ## BF16:
+"""That's great! What kind of adventure does she like?"""
 prompt = "9.11和9.8哪个数字大"
 最终答案：9.8更大。
 """
 ##BF16:
 """
 要比较9.11和9.8的大小，我们可以按照以下步骤进行：
+1. **直接比较**：将两个数相减：
+   \[
+   9.11 - 9.8 = -0.69
+   \]
+2. **理解结果**：-0.69表示的是一个负数。因为9.11比9.8小。
+因此，9.8比9.11大。
 """
 prompt = "Once upon a time,"
 ##INT4:
 """I'm sorry, but I don't understand what you're asking me to do or what information you want me to provide. Could you please clarify your question or provide more context? I'd be happy to help if you can give me all the information you need."""
 ##BF16:
+"""once upon a time, there was a young girl named Lily who lived in a small village nestled between two mountains. She had always been fascinated by the natural world and dreamed of exploring it further.
+One day, while wandering through the forest, she stumbled upon an old, mysterious book that seemed to have been written on its pages. As she read, she realized that the book contained secrets about the hidden treasures of the earth.
+Lily was determined to uncover these secrets and become a true explorer. She spent hours poring over the pages, trying to understand what the author was trying to tell her.
+Finally, after many days of research and study, Lily discovered the location of the treasure. It lay deep within the heart of the mountain range, guarded by powerful forces that only those with the right knowledge could reach.
+With great excitement, Lily set out on her journey to find the treasure. She traveled for weeks, crossing treacherous terrain and battling fierce beasts along the way. But even as she"""
+prompt = "请简短介绍一下阿里巴巴公司"
 ##INT4:
 """阿里巴巴集团是全球领先的电子商务和云计算服务提供商，成立于1999年。该公司总部位于中国杭州，并在多个国家和地区设有办事处和运营中心。阿里巴巴集团的业务包括在线零售、移动支付、云计算、人工智能等。阿里巴巴集团是中国最大的电子商务平台之一，也是全球最大的电商平台之一。阿里巴巴集团还拥有众多子公司和品牌，如淘宝、天猫、菜鸟网络等。阿里巴巴集团在全球范围内拥有超过20亿活跃用户，每年销售额超过3500亿美元。阿里巴巴集团致力于通过创新和智能化技术推动商业变革，为消费者提供更便捷、更个性化的购物体验。"""
 ##BF16:
+"""阿里巴巴集团是全球最大的电子商务平台之一，成立于1999年。该公司提供包括淘宝、天猫、阿里云等在内的众多产品和服务，是中国乃至全球领先的互联网企业之一。"""
 ```
 ### Evaluate the model
 | Metric                                     |  BF16  |  INT4  |
 | :----------------------------------------- | :----: | :----: |
 | Avg                                        | 0.4229 | 0.4124 |
+| leaderboard_mmlu_pro 5 shots               | 0.1877 | 0.1678 |
 | leaderboard_ifeval inst_level_strict_acc   | 0.3501 | 0.3441 |
 | leaderboard_ifeval prompt_level_strict_acc | 0.2107 | 0.2218 |
 | mmlu                                       | 0.4582 | 0.4434 |
 | cmmlu                                      | 0.5033 | 0.4542 |
 | ceval-valid                                | 0.5327 | 0.4918 |
 ### Generate the model
+Here is the sample command to generate the model. We observed a larger accuracy drop in Chinese tasks and recommend using a high-quality Chinese dataset for calibration or smaller group_size like 32.
 ```bash
 auto-round \

config.json CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:96f5d8e1d262852583bf8492ba2c4b8d101db7d0d60f8d3e6c7a42f9b36aa4dc
-size 1367

 version https://git-lfs.github.com/spec/v1
+oid sha256:8e9dae0baf5b06f44798794cbc7b60eba5588556cca9f507b25cd35a82ac751f
+size 1381

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4e0209a213dc03574cf5f5052e2e4c8726a196bee189b27aea69fb5bcc04cb26
-size 459946568

 version https://git-lfs.github.com/spec/v1
+oid sha256:4c71884c6fc5f6c12774e9d1d81b94d78f97266ab40cacf1b54c6b0659adcd45
+size 459383592

quantization_config.json CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d603e56eb4bb154a31dc23a83f243fc179aeb8631b8a0639837c3d06b06e8d8b
-size 569

 version https://git-lfs.github.com/spec/v1
+oid sha256:f92209e21368ef298866e57e5f3838e7590119ba042ef4c15bf642f7f60e4f40
+size 575