Lewdiculous
/

L3-8B-Stheno-v3.2-GGUF-IQ-Imatrix

Model card Files Files and versions Community

Lewdiculous commited on Jun 7, 2024

Commit

656f88a

·

verified ·

1 Parent(s): 18584ec

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -24,7 +24,7 @@ My GGUF-IQ-Imatrix quants for [Sao10K/L3-8B-Stheno-v3.2](https://huggingface.co/
 > [!NOTE]
 > **General usage:** <br>
-> Use the latest version of **KoboldCpp**. <br>
 > For **8GB VRAM** GPUs, I recommend the **Q4_K_M-imat** (4.89 BPW) quant for up to 12288 context sizes. <br>
 >
 > **Presets:** <br>

 > [!NOTE]
 > **General usage:** <br>
+> Use the [**latest version of KoboldCpp**](https://github.com/LostRuins/koboldcpp/releases/latest). <br>
 > For **8GB VRAM** GPUs, I recommend the **Q4_K_M-imat** (4.89 BPW) quant for up to 12288 context sizes. <br>
 >
 > **Presets:** <br>