Update README.md
Browse files
README.md
CHANGED
@@ -13,6 +13,18 @@ base_model: meta-llama/Llama-3.1-70B-Instruct
|
|
13 |
pipeline_tag: text-generation
|
14 |
library_name: transformers
|
15 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
16 |
# Model Overview
|
17 |
|
18 |
## Description:
|
|
|
13 |
pipeline_tag: text-generation
|
14 |
library_name: transformers
|
15 |
---
|
16 |
+
|
17 |
+
Original model: https://huggingface.co/nvidia/Llama-3.1-Nemotron-70B-Reward-HF
|
18 |
+
---
|
19 |
+
<p><h2>ExLlamaV2 Quantization</h2></p>
|
20 |
+
<p>Quantized with <a href="https://github.com/turboderp/exllamav2/releases/tag/v0.2.7">ExLlamaV2 v0.2.7</a></p>
|
21 |
+
|
22 |
+
[2.2 Bits Per Weight](https://huggingface.co/UnstableLlama/Llama-3.1-Nemotron-70B-Reward-exl2/tree/2_2)
|
23 |
+
|
24 |
+
[4.6 Bits Per Weight](https://huggingface.co/UnstableLlama/Llama-3.1-Nemotron-70B-Reward-exl2/tree/4_6)
|
25 |
+
|
26 |
+
|
27 |
+
|
28 |
# Model Overview
|
29 |
|
30 |
## Description:
|