Update README.md
Browse files
README.md
CHANGED
@@ -8,6 +8,6 @@ base_model:
|
|
8 |
IQ4_XS (4.2G, 8.7992 +/- 0.11237, fits into 8GiB VRAM + 4096 context with F16 KV cache)
|
9 |
Q4_K_M (4.6G, 8.7948 +/- 0.11223, fits into 8GiB VRAM + 4096 context with F16 KV cache, also good for CPU inference on E5-26xx v3/v4)
|
10 |
Q8_0 (8.0G, 8.5970 +/- 0.10933, imatrix derived from it)
|
11 |
-
F16 ( 15G, 8.6617 +/- 0.11043, for 24GiB VRAM
|
12 |
```
|
13 |
Perplexity measured with `-fa -c 2048 -ub 2048` on UTF-8 text version of ["Wired Love" from Project Gutenberg](http://www.gutenberg.org/ebooks/24353).
|
|
|
8 |
IQ4_XS (4.2G, 8.7992 +/- 0.11237, fits into 8GiB VRAM + 4096 context with F16 KV cache)
|
9 |
Q4_K_M (4.6G, 8.7948 +/- 0.11223, fits into 8GiB VRAM + 4096 context with F16 KV cache, also good for CPU inference on E5-26xx v3/v4)
|
10 |
Q8_0 (8.0G, 8.5970 +/- 0.10933, imatrix derived from it)
|
11 |
+
F16 ( 15G, 8.6617 +/- 0.11043, for 24GiB VRAM)
|
12 |
```
|
13 |
Perplexity measured with `-fa -c 2048 -ub 2048` on UTF-8 text version of ["Wired Love" from Project Gutenberg](http://www.gutenberg.org/ebooks/24353).
|