Update README.md
Browse files
README.md
CHANGED
@@ -13,8 +13,6 @@ tags:
|
|
13 |
- llama-3
|
14 |
---
|
15 |
|
16 |
-
<img src="https://cdn-uploads.huggingface.co/production/uploads/5df9c78eda6d0311fd3d541f/vlfv5sHbt4hBxb3YwULlU.png" alt="Taiwan LLM Logo" width="600" style="margin-left:'auto' margin-right:'auto' display:'block'"/>
|
17 |
-
|
18 |
# Llama-3-Taiwan-70B-Instruct - GPTQ
|
19 |
- Model creator: [Yen-Ting Lin](https://huggingface.co/yentinglin)
|
20 |
- Original model: [Llama-3-Taiwan-70B-Instruct](https://huggingface.co/yentinglin/Llama-3-Taiwan-70B-Instruct)
|
@@ -24,19 +22,19 @@ tags:
|
|
24 |
|
25 |
This repo contains GPTQ model files for [Llama-3-Taiwan-70B-Instruct](https://huggingface.co/yentinglin/Llama-3-Taiwan-70B-Instruct).
|
26 |
|
27 |
-
Multiple GPTQ parameter permutations are provided; see Provided Files below for details of the options provided, their parameters, and the software used to create them.
|
28 |
-
|
29 |
<!-- description end -->
|
30 |
<!-- repositories-available start -->
|
31 |
* [GPTQ models for GPU inference](minyichen/Llama-3-Taiwan-70B-Instruct-GPTQ)
|
32 |
* [Yen-Ting Lin's original unquantized model](https://huggingface.co/yentinglin/Llama-3-Taiwan-70B-Instruct)
|
33 |
<!-- repositories-available end -->
|
34 |
|
35 |
-
|
36 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
37 |
|
38 |
-
```
|
39 |
-
A chat between a curious user and an artificial intelligence assistant. The assistant gives helpful, detailed, and polite answers to the user's questions. USER: {prompt} ASSISTANT:
|
40 |
|
41 |
-
```
|
42 |
-
<!-- prompt-template end -->
|
|
|
13 |
- llama-3
|
14 |
---
|
15 |
|
|
|
|
|
16 |
# Llama-3-Taiwan-70B-Instruct - GPTQ
|
17 |
- Model creator: [Yen-Ting Lin](https://huggingface.co/yentinglin)
|
18 |
- Original model: [Llama-3-Taiwan-70B-Instruct](https://huggingface.co/yentinglin/Llama-3-Taiwan-70B-Instruct)
|
|
|
22 |
|
23 |
This repo contains GPTQ model files for [Llama-3-Taiwan-70B-Instruct](https://huggingface.co/yentinglin/Llama-3-Taiwan-70B-Instruct).
|
24 |
|
|
|
|
|
25 |
<!-- description end -->
|
26 |
<!-- repositories-available start -->
|
27 |
* [GPTQ models for GPU inference](minyichen/Llama-3-Taiwan-70B-Instruct-GPTQ)
|
28 |
* [Yen-Ting Lin's original unquantized model](https://huggingface.co/yentinglin/Llama-3-Taiwan-70B-Instruct)
|
29 |
<!-- repositories-available end -->
|
30 |
|
31 |
+
## Quantization parameter
|
32 |
+
|
33 |
+
- Bits : 4
|
34 |
+
- Group Size : 128
|
35 |
+
- Act Order : Yes
|
36 |
+
- Damp % : 0.01
|
37 |
+
- Seq Len : 2048
|
38 |
+
- Size : 37.07 GB
|
39 |
|
|
|
|
|
40 |
|
|
|
|