DavidAU
/

Mistral-Small-Instruct-2409-22B-NEO-Imatrix-GGUF

Model card Files Files and versions Community

DavidAU commited on Sep 19, 2024

Commit

9b969cf

·

verified ·

1 Parent(s): d7b6b9b

Update README.md

Files changed (1) hide show

README.md +5 -4

README.md CHANGED Viewed

@@ -46,10 +46,7 @@ pipeline_tag: text-generation
 It is the new "Mistral-Small-Instruct 2409 22B", max context of 131,000 (128k) with the NEO IMATRIX dataset.
-This model IS bullet proof and operates with all parameters, including temp settings from 0 to 5.
-It is an extraordinary compressed model at a PPL level of 4.8611 +/- 0.06701 (Q4_K_M).
 The NEO IMATRIX dataset V2 was applied to it to enhance creativity.
 4 examples provided to show differences at TEMP=0 and at TEMP=1 for both non-imatrix and NEO imatrix versions.
@@ -64,6 +61,10 @@ Please refer to the original model card for this model from MistralAI for additi
 Imatrix quants perform best at IQ3s and IQ4s, then Q4s, lower on Q5, and tappers off at Q6.
 Due to the parameter count of this model, even IQ2s quants will work very well.
 Q8 is not uploaded here because Imatrix has no effect on this quant.

 It is the new "Mistral-Small-Instruct 2409 22B", max context of 131,000 (128k) with the NEO IMATRIX dataset.
+This model IS bullet proof and operates with all parameters, including temp settings from 0 to 5. It is an extraordinary compressed model at a PPL level of 4.8611 +/- 0.06701 (Q4_K_M).
 The NEO IMATRIX dataset V2 was applied to it to enhance creativity.
 4 examples provided to show differences at TEMP=0 and at TEMP=1 for both non-imatrix and NEO imatrix versions.
 Imatrix quants perform best at IQ3s and IQ4s, then Q4s, lower on Q5, and tappers off at Q6.
+Recommend: IQ4_XS for maximum imatrix effect and best "bit count".
+For stronger IMATRIX effect, IQ3s, and IQ2s.
 Due to the parameter count of this model, even IQ2s quants will work very well.
 Q8 is not uploaded here because Imatrix has no effect on this quant.