yeyu-nvidia commited on
Commit
f38b5ad
·
1 Parent(s): 15134f3

update model card and include license file

Browse files
.gitattributes CHANGED
File without changes
LICENSE.pdf ADDED
Binary file (152 kB). View file
 
README.md CHANGED
@@ -10,7 +10,7 @@ This model is ready for commercial and non-commercial use. <br>
10
  This model is not owned or developed by NVIDIA. This model has been developed and built to a third-party’s requirements for this application and use case; see link to Non-NVIDIA [(Meta-Llama-3.1-8B-Instruct) Model Card](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct).
11
 
12
  ### License/Terms of Use:
13
- [llama3.1](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B/blob/main/LICENSE)
14
 
15
  ## Model Architecture:
16
  **Architecture Type:** Transformer <br>
@@ -134,8 +134,8 @@ python convert_checkpoint.py --model_dir ./hf_ckpt \
134
  * Build engine:
135
  ```sh
136
  trtllm-build --checkpoint_dir /trtllm_ckpt --output_dir /engine \
137
- --gemm_plugin float16 --speculative_decoding_mode medusa \
138
- --max_batch_size 4
139
  ```
140
  * Accuracy evaluation:
141
  1) Prepare the MMLU dataset:
 
10
  This model is not owned or developed by NVIDIA. This model has been developed and built to a third-party’s requirements for this application and use case; see link to Non-NVIDIA [(Meta-Llama-3.1-8B-Instruct) Model Card](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct).
11
 
12
  ### License/Terms of Use:
13
+ GOVERNING TERMS: Use of this model is governed by the [NVIDIA Open Models License](https://www.nvidia.com/en-us/agreements/enterprise-software/nvidia-open-model-license/). ADDITIONAL INFORMATION: [Llama 3.1 Community License Agreement](https://www.llama.com/llama3_1/license/). Built with Meta Llama 3.1.
14
 
15
  ## Model Architecture:
16
  **Architecture Type:** Transformer <br>
 
134
  * Build engine:
135
  ```sh
136
  trtllm-build --checkpoint_dir /trtllm_ckpt --output_dir /engine \
137
+ --gemm_plugin float16 --speculative_decoding_mode medusa \
138
+ --max_batch_size 4
139
  ```
140
  * Accuracy evaluation:
141
  1) Prepare the MMLU dataset:
config.json CHANGED
File without changes
generation_config.json CHANGED
File without changes
hf_quant_config.json CHANGED
File without changes
model-00001-of-00003.safetensors CHANGED
File without changes
model-00002-of-00003.safetensors CHANGED
File without changes
model-00003-of-00003.safetensors CHANGED
File without changes
model.safetensors.index.json CHANGED
File without changes
special_tokens_map.json CHANGED
File without changes
tokenizer.json CHANGED
File without changes
tokenizer_config.json CHANGED
File without changes