InferenceIllusionist
/

CybersurferNyandroidLexicat-8x7B-iMat-GGUF

Not-For-All-Audiences

Inference Endpoints

Model card Files Files and versions Community

InferenceIllusionist commited on Mar 14, 2024

Commit

7f9a963

·

verified ·

1 Parent(s): 1e3b861

Update README.md

Files changed (1) hide show

README.md +4 -1

README.md CHANGED Viewed

@@ -6,12 +6,15 @@ tags:
 - conversational
 - gguf
 ---
 [CybersurferNyandroidLexicat](https://huggingface.co/Envoid/CybersurferNyandroidLexicat-8x7B) quantized from fp16 with love.
 Uses the same imat calculation method as the later batch of maid-yuzu-v8-alter-iMat-GGUF.
-<b>Legacy quants (i.e. Q5_K_M, Q6_K, etc) in this repo have all been enhanced with the imatrix calculation. No need for two separate repos.</b>
 For more information on latest iMatrix quants see this PR - https://github.com/ggerganov/llama.cpp/pull/5747

 - conversational
 - gguf
 ---
+# CybersurferNyandroidLexicat-8x7B-iMat-GGUF
 [CybersurferNyandroidLexicat](https://huggingface.co/Envoid/CybersurferNyandroidLexicat-8x7B) quantized from fp16 with love.
 Uses the same imat calculation method as the later batch of maid-yuzu-v8-alter-iMat-GGUF.
+<b>Legacy quants (i.e. Q5_K_M, Q6_K, etc) in this repo have all been enhanced with the imatrix calculation. These quants show improved KL-Divergence over their static counterparts.</b>
+All files included here for your convenience. No need to clone the entire repo, just pick the quant that's right for you.
 For more information on latest iMatrix quants see this PR - https://github.com/ggerganov/llama.cpp/pull/5747