Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,7 @@
|
|
1 |
---
|
2 |
license: unknown
|
3 |
---
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
license: unknown
|
3 |
---
|
4 |
+
|
5 |
+
This is a 2.4-bit EXL2 quantization of [Aurelian v0.5 70B 32K](https://huggingface.co/grimulkan/aurelian-v0.5-70b-rope8-32K-fp16), an interim checkpoint before v1.0. See that page for more details.
|
6 |
+
|
7 |
+
This quantization fits in a single 24GB using Exllamav2 & 8-bit cache @ 10K context. It uses the newer experimental quantization method from turboderp.
|