grimulkan commited on
Commit
f39d558
·
verified ·
1 Parent(s): ee146b0

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -0
README.md CHANGED
@@ -1,3 +1,7 @@
1
  ---
2
  license: unknown
3
  ---
 
 
 
 
 
1
  ---
2
  license: unknown
3
  ---
4
+
5
+ This is a 2.4-bit EXL2 quantization of [Aurelian v0.5 70B 32K](https://huggingface.co/grimulkan/aurelian-v0.5-70b-rope8-32K-fp16), an interim checkpoint before v1.0. See that page for more details.
6
+
7
+ This quantization fits in a single 24GB using Exllamav2 & 8-bit cache @ 10K context. It uses the newer experimental quantization method from turboderp.