update model card and include license file

Files changed (13) hide show

.gitattributes CHANGED Viewed

File without changes

LICENSE.pdf ADDED Viewed

Binary file (152 kB). View file

README.md CHANGED Viewed

@@ -10,7 +10,7 @@ This model is ready for commercial and non-commercial use.  <br>
 This model is not owned or developed by NVIDIA. This model has been developed and built to a third-party’s requirements for this application and use case; see link to Non-NVIDIA [(Meta-Llama-3.1-8B-Instruct) Model Card](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct).
 ### License/Terms of Use:
-[llama3.1](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B/blob/main/LICENSE)
 ## Model Architecture:
 **Architecture Type:** Transformer  <br>
@@ -134,8 +134,8 @@ python convert_checkpoint.py --model_dir ./hf_ckpt \
 * Build engine:
 ```sh
 trtllm-build --checkpoint_dir /trtllm_ckpt --output_dir /engine \
---gemm_plugin float16 --speculative_decoding_mode medusa \
---max_batch_size 4
 ```
 * Accuracy evaluation:
 1) Prepare the MMLU dataset:

 This model is not owned or developed by NVIDIA. This model has been developed and built to a third-party’s requirements for this application and use case; see link to Non-NVIDIA [(Meta-Llama-3.1-8B-Instruct) Model Card](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct).
 ### License/Terms of Use:
+GOVERNING TERMS: Use of this model is governed by the [NVIDIA Open Models License](https://www.nvidia.com/en-us/agreements/enterprise-software/nvidia-open-model-license/). ADDITIONAL INFORMATION: [Llama 3.1 Community License Agreement](https://www.llama.com/llama3_1/license/). Built with Meta Llama 3.1.
 ## Model Architecture:
 **Architecture Type:** Transformer  <br>
 * Build engine:
 ```sh
 trtllm-build --checkpoint_dir /trtllm_ckpt --output_dir /engine \
+    --gemm_plugin float16 --speculative_decoding_mode medusa \
+    --max_batch_size 4
 ```
 * Accuracy evaluation:
 1) Prepare the MMLU dataset:

config.json CHANGED Viewed

File without changes

generation_config.json CHANGED Viewed

File without changes

hf_quant_config.json CHANGED Viewed

File without changes

model-00001-of-00003.safetensors CHANGED Viewed

File without changes

model-00002-of-00003.safetensors CHANGED Viewed

File without changes

model-00003-of-00003.safetensors CHANGED Viewed

File without changes

model.safetensors.index.json CHANGED Viewed

File without changes

special_tokens_map.json CHANGED Viewed

File without changes

tokenizer.json CHANGED Viewed

File without changes

tokenizer_config.json CHANGED Viewed

File without changes