Commit
·
f38b5ad
1
Parent(s):
15134f3
update model card and include license file
Browse files- .gitattributes +0 -0
- LICENSE.pdf +0 -0
- README.md +3 -3
- config.json +0 -0
- generation_config.json +0 -0
- hf_quant_config.json +0 -0
- model-00001-of-00003.safetensors +0 -0
- model-00002-of-00003.safetensors +0 -0
- model-00003-of-00003.safetensors +0 -0
- model.safetensors.index.json +0 -0
- special_tokens_map.json +0 -0
- tokenizer.json +0 -0
- tokenizer_config.json +0 -0
.gitattributes
CHANGED
File without changes
|
LICENSE.pdf
ADDED
Binary file (152 kB). View file
|
|
README.md
CHANGED
@@ -10,7 +10,7 @@ This model is ready for commercial and non-commercial use. <br>
|
|
10 |
This model is not owned or developed by NVIDIA. This model has been developed and built to a third-party’s requirements for this application and use case; see link to Non-NVIDIA [(Meta-Llama-3.1-8B-Instruct) Model Card](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct).
|
11 |
|
12 |
### License/Terms of Use:
|
13 |
-
[
|
14 |
|
15 |
## Model Architecture:
|
16 |
**Architecture Type:** Transformer <br>
|
@@ -134,8 +134,8 @@ python convert_checkpoint.py --model_dir ./hf_ckpt \
|
|
134 |
* Build engine:
|
135 |
```sh
|
136 |
trtllm-build --checkpoint_dir /trtllm_ckpt --output_dir /engine \
|
137 |
-
--gemm_plugin float16 --speculative_decoding_mode medusa \
|
138 |
-
--max_batch_size 4
|
139 |
```
|
140 |
* Accuracy evaluation:
|
141 |
1) Prepare the MMLU dataset:
|
|
|
10 |
This model is not owned or developed by NVIDIA. This model has been developed and built to a third-party’s requirements for this application and use case; see link to Non-NVIDIA [(Meta-Llama-3.1-8B-Instruct) Model Card](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct).
|
11 |
|
12 |
### License/Terms of Use:
|
13 |
+
GOVERNING TERMS: Use of this model is governed by the [NVIDIA Open Models License](https://www.nvidia.com/en-us/agreements/enterprise-software/nvidia-open-model-license/). ADDITIONAL INFORMATION: [Llama 3.1 Community License Agreement](https://www.llama.com/llama3_1/license/). Built with Meta Llama 3.1.
|
14 |
|
15 |
## Model Architecture:
|
16 |
**Architecture Type:** Transformer <br>
|
|
|
134 |
* Build engine:
|
135 |
```sh
|
136 |
trtllm-build --checkpoint_dir /trtllm_ckpt --output_dir /engine \
|
137 |
+
--gemm_plugin float16 --speculative_decoding_mode medusa \
|
138 |
+
--max_batch_size 4
|
139 |
```
|
140 |
* Accuracy evaluation:
|
141 |
1) Prepare the MMLU dataset:
|
config.json
CHANGED
File without changes
|
generation_config.json
CHANGED
File without changes
|
hf_quant_config.json
CHANGED
File without changes
|
model-00001-of-00003.safetensors
CHANGED
File without changes
|
model-00002-of-00003.safetensors
CHANGED
File without changes
|
model-00003-of-00003.safetensors
CHANGED
File without changes
|
model.safetensors.index.json
CHANGED
File without changes
|
special_tokens_map.json
CHANGED
File without changes
|
tokenizer.json
CHANGED
File without changes
|
tokenizer_config.json
CHANGED
File without changes
|