Update README.md
Browse files
README.md
CHANGED
@@ -30,7 +30,11 @@ You can access the model at https://arliai.com and ask questions at https://www.
|
|
30 |
|
31 |
The model is available in quantized formats:
|
32 |
|
|
|
|
|
33 |
* **FP16**: https://huggingface.co/ArliAI/Llama-3.1-8B-ArliAI-RPMax-v1.1
|
|
|
|
|
34 |
* **GGUF**: https://huggingface.co/ArliAI/Llama-3.1-8B-ArliAI-RPMax-v1.1-GGUF
|
35 |
|
36 |
## Suggested Prompt Format
|
|
|
30 |
|
31 |
The model is available in quantized formats:
|
32 |
|
33 |
+
We recommend using full weights or GPTQ
|
34 |
+
|
35 |
* **FP16**: https://huggingface.co/ArliAI/Llama-3.1-8B-ArliAI-RPMax-v1.1
|
36 |
+
* **GPTQ_Q4**: https://huggingface.co/ArliAI/Llama-3.1-8B-ArliAI-RPMax-v1.1-GPTQ_Q4
|
37 |
+
* **GPTQ_Q8**: https://huggingface.co/ArliAI/Llama-3.1-8B-ArliAI-RPMax-v1.1-GPTQ_Q8
|
38 |
* **GGUF**: https://huggingface.co/ArliAI/Llama-3.1-8B-ArliAI-RPMax-v1.1-GGUF
|
39 |
|
40 |
## Suggested Prompt Format
|