lpreusser commited on
Commit
977506b
·
verified ·
1 Parent(s): fa021e1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +22 -11
README.md CHANGED
@@ -1,17 +1,28 @@
1
- ---
2
- license: llama3.3
3
- base_model:
4
- - meta-llama/Llama-3.3-70B-Instruct
5
- library_name: transformers
6
- tags:
7
- - meta
8
- - llama-3.3
9
- - fp8-dynamic
10
- ---
11
 
12
  ## Quantized Model Information
13
 
14
  > [!IMPORTANT]
15
  > This repository is a 'FP8-Dynamic' quantized version of [`meta-llama/Llama-3.3-70B-Instruct`](https://huggingface.co/meta-llama/Llama-3.3-70B-Instruct), originally released by Meta AI.
16
 
17
- For usage instructions please refer to the original model [`meta-llama/Llama-3.3-70B-Instruct`](https://huggingface.co/meta-llama/Llama-3.3-70B-Instruct).
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: llama3.3
3
+ base_model:
4
+ - meta-llama/Llama-3.3-70B-Instruct
5
+ library_name: transformers
6
+ tags:
7
+ - meta
8
+ - llama-3.3
9
+ - fp8-dynamic
10
+ ---
11
 
12
  ## Quantized Model Information
13
 
14
  > [!IMPORTANT]
15
  > This repository is a 'FP8-Dynamic' quantized version of [`meta-llama/Llama-3.3-70B-Instruct`](https://huggingface.co/meta-llama/Llama-3.3-70B-Instruct), originally released by Meta AI.
16
 
17
+ For usage instructions please refer to the original model [`meta-llama/Llama-3.3-70B-Instruct`](https://huggingface.co/meta-llama/Llama-3.3-70B-Instruct).
18
+
19
+ ## Performance
20
+ All benchmarks were done using the [`LLM Evaluation Harness`](https://github.com/EleutherAI/lm-evaluation-harness)
21
+ | | | Llama-3.3-70B-Instruct-FP8-Dynamic | Llama-3.3-70B-Instruct (base) | recovery |
22
+ | :---- | :---- | :---: | :---: | :---: |
23
+ | mmlu | - | xx | xx | xx |
24
+ | | | xx | xx | xx |
25
+ | hellaswag | acc | 65.69 | - | |
26
+ | | acc_sterr | 0.47 | - | |
27
+ | | acc_norm | 84.36 | - | |
28
+ | | acc_sterr | 0.36 | - | |