legraphista
/

Codestral-22B-v0.1-IMat-GGUF

@@ -20,43 +20,28 @@ tags:
 - static
 ---
-<div style="border: 2px solid #c4382d; border-left-width: 8px; padding: 16px 32px; border-radius: 0 10px 10px 0;">
-  <h3>Known Issue:</h3>
-  Initial version of <a href="https://huggingface.co/mistralai/Codestral-22B-v0.1">mistralai/Codestral-22B-v0.1</a> had a missing tokens for FIM (Fill In the Middle) - see <a href="https://huggingface.co/mistralai/Codestral-22B-v0.1/discussions/10#665856e3d3e05bb21be16140">discussion</a>
-  <h3>Fixed here:</h3>
-  <ul>
-    <li><a href="https://huggingface.co/legraphista/Codestral-22B-v0.1-hf-FIM-fix-IMat-GGUF">legraphista/Codestral-22B-v0.1-hf-FIM-fix-IMat-GGUF</a>  </li>
-    <li><a href="https://huggingface.co/legraphista/Codestral-22B-v0.1-hf-FIM-fix">legraphista/Codestral-22B-v0.1-hf-FIM-fix</a></li>
-  </ul>
-</div>
-<br>
 # Codestral-22B-v0.1-IMat-GGUF
 _Llama.cpp imatrix quantization of mistralai/Codestral-22B-v0.1 (bullerwins/Codestral-22B-v0.1-hf)_
 Original model: [mistralai/Codestral-22B-v0.1](https://huggingface.co/mistralai/Codestral-22B-v0.1)
-Quantized HF Model: [bullerwins/Codestral-22B-v0.1-hf](https://huggingface.co/bullerwins/Codestral-22B-v0.1-hf)
 Original dtype: `BF16` (`bfloat16`)
-Quantized by: llama.cpp [b3037](https://github.com/ggerganov/llama.cpp/releases/tag/b3037)
 IMatrix dataset: [here](https://gist.githubusercontent.com/legraphista/d6d93f1a254bcfc58e0af3777eaec41e/raw/d380e7002cea4a51c33fffd47db851942754e7cc/imatrix.calibration.medium.raw)
-- [Codestral-22B-v0.1-IMat-GGUF](#codestral-22b-v0-1-imat-gguf)
-    - [Files](#files)
-        - [IMatrix](#imatrix)
-        - [Common Quants](#common-quants)
-        - [All Quants](#all-quants)
-    - [Downloading using huggingface-cli](#downloading-using-huggingface-cli)
-    - [Inference](#inference)
-        - [Simple chat template](#simple-chat-template)
-        - [Chat template with system prompt](#chat-template-with-system-prompt)
-        - [Llama.cpp](#llama-cpp)
-    - [FAQ](#faq)
-        - [Why is the IMatrix not applied everywhere?](#why-is-the-imatrix-not-applied-everywhere)
-        - [How do I merge a split GGUF?](#how-do-i-merge-a-split-gguf)
 ---

 - static
 ---
 # Codestral-22B-v0.1-IMat-GGUF
 _Llama.cpp imatrix quantization of mistralai/Codestral-22B-v0.1 (bullerwins/Codestral-22B-v0.1-hf)_
 Original model: [mistralai/Codestral-22B-v0.1](https://huggingface.co/mistralai/Codestral-22B-v0.1)
+Quantized HF Model: [legraphista/Codestral-22B-v0.1-hf-FIM-fix](https://huggingface.co/legraphista/Codestral-22B-v0.1-hf-FIM-fix)
 Original dtype: `BF16` (`bfloat16`)
+Quantized by: llama.cpp [b3046](https://github.com/ggerganov/llama.cpp/releases/tag/b3046)
 IMatrix dataset: [here](https://gist.githubusercontent.com/legraphista/d6d93f1a254bcfc58e0af3777eaec41e/raw/d380e7002cea4a51c33fffd47db851942754e7cc/imatrix.calibration.medium.raw)
+- [Files](#files)
+    - [IMatrix](#imatrix)
+    - [Common Quants](#common-quants)
+    - [All Quants](#all-quants)
+- [Downloading using huggingface-cli](#downloading-using-huggingface-cli)
+- [Inference](#inference)
+    - [Simple chat template](#simple-chat-template)
+    - [Chat template with system prompt](#chat-template-with-system-prompt)
+    - [FIM / Fill In the Middle](#fim-fill-in-the-middle)
+    - [Llama.cpp](#llama-cpp)
+- [FAQ](#faq)
+    - [Why is the IMatrix not applied everywhere?](#why-is-the-imatrix-not-applied-everywhere)
+    - [How do I merge a split GGUF?](#how-do-i-merge-a-split-gguf)
 ---