fsaudm
/

Meta-Llama-3.1-8B-Instruct-NF4

Text Generation

text-generation-inference

Inference Endpoints

4-bit precision

Model card Files Files and versions Community

fsaudm commited on Aug 21, 2024

Commit

253471f

·

verified ·

1 Parent(s): 91cebb2

Update README.md

Files changed (1) hide show

README.md +40 -5

README.md CHANGED Viewed

@@ -1,21 +1,56 @@
 ---
 library_name: transformers
 base_model: meta-llama/Meta-Llama-3.1-8B-Instruct
-license: llama3
 model-index:
 - name: Meta-Llama-3.1-8B-Instruct-INT4
   results: []
 language:
 - en
-- es
 - it
-- ar
 ---
 # Model Card for Model ID
 <!-- Provide a quick summary of what the model is/does. -->
 - **Developed by:** [More Information Needed]
-- **License:** [More Information Needed]
-- **Finetuned from model [optional]:** [More Information Needed]

 ---
 library_name: transformers
 base_model: meta-llama/Meta-Llama-3.1-8B-Instruct
+license: llama3.1
 model-index:
 - name: Meta-Llama-3.1-8B-Instruct-INT4
   results: []
 language:
 - en
+- de
+- fr
 - it
+- pt
+- hi
+- es
+- th
+tags:
+- facebook
+- meta
+- pytorch
+- llama
+- llama-3
 ---
 # Model Card for Model ID
 <!-- Provide a quick summary of what the model is/does. -->
+This is a quantized version of `Llama 3.1 70B Instruct`. Quantization to **4-bit** using `bistandbytes` and `accelerate`.
 - **Developed by:** [More Information Needed]
+- **License:** llama3.1
+- **Base Model [optional]:** meta-llama/Meta-Llama-3.1-8B-Instruct
+```
+# Use a pipeline as a high-level helper
+from transformers import pipeline
+messages = [
+    {"role": "user", "content": "Who are you?"},
+]
+pipe = pipeline("text-generation", model="meta-llama/Meta-Llama-3.1-8B-Instruct")
+pipe(messages)   Copy  # Load model directly
+```
+```
+from transformers import AutoTokenizer, AutoModelForCausalLM
+tokenizer = AutoTokenizer.from_pretrained("meta-llama/Meta-Llama-3.1-8B-Instruct")
+model = AutoModelForCausalLM.from_pretrained("meta-llama/Meta-Llama-3.1-8B-Instruct")
+```
+The model information can be found in the original [meta-llama/Meta-Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct)