just-add-ai
/

Llama-3.3-70B-Instruct-FP8-Dynamic

Text Generation

text-generation-inference

compressed-tensors

Model card Files Files and versions

lpreusser commited on Feb 7

Commit

977506b

·

verified ·

1 Parent(s): fa021e1

Update README.md

Files changed (1) hide show

README.md +22 -11

README.md CHANGED Viewed

@@ -1,17 +1,28 @@
----
-license: llama3.3
-base_model:
-- meta-llama/Llama-3.3-70B-Instruct
-library_name: transformers
-tags:
-- meta
-- llama-3.3
-- fp8-dynamic
----
 ## Quantized Model Information
 > [!IMPORTANT]
 > This repository is a 'FP8-Dynamic' quantized version of [`meta-llama/Llama-3.3-70B-Instruct`](https://huggingface.co/meta-llama/Llama-3.3-70B-Instruct), originally released by Meta AI.
-For usage instructions please refer to the original model [`meta-llama/Llama-3.3-70B-Instruct`](https://huggingface.co/meta-llama/Llama-3.3-70B-Instruct).

+---
+license: llama3.3
+base_model:
+- meta-llama/Llama-3.3-70B-Instruct
+library_name: transformers
+tags:
+- meta
+- llama-3.3
+- fp8-dynamic
+---
 ## Quantized Model Information
 > [!IMPORTANT]
 > This repository is a 'FP8-Dynamic' quantized version of [`meta-llama/Llama-3.3-70B-Instruct`](https://huggingface.co/meta-llama/Llama-3.3-70B-Instruct), originally released by Meta AI.
+For usage instructions please refer to the original model [`meta-llama/Llama-3.3-70B-Instruct`](https://huggingface.co/meta-llama/Llama-3.3-70B-Instruct).
+## Performance
+All benchmarks were done using the [`LLM Evaluation Harness`](https://github.com/EleutherAI/lm-evaluation-harness)
+| | | Llama-3.3-70B-Instruct-FP8-Dynamic | Llama-3.3-70B-Instruct (base) | recovery |
+| :---- | :---- | :---: | :---: | :---: |
+| mmlu  | - | xx | xx | xx |
+| | | xx | xx | xx |
+| hellaswag  | acc | 65.69 | - | |
+| | acc_sterr | 0.47 | - | |
+| | acc_norm | 84.36 | - | |
+| | acc_sterr | 0.36 | - | |