NexesQuants/google_gemma-3-4b-it-qat-q4_0-unquantized-iMat-NXS-GGUF

Benchs

On llama-perplexity -m E:\text-generation-webui\models\google_gemma-3-4b-it-qat-q4_0-unquantized_CHOSENQUANT.gguf -f wiki.test.raw -fa -mg 0 -ngl 150 -ts 40,0,0 -b 512 --no-mmap -c 512

BF16: PPL = 15.1898 +/- 0.14353
For pure Q4_0 (imat): PPL = 15.4831 +/- 0.14487
For pure IQ4_XS (imat): PPL = 14.5142 +/- 0.13311 (!!!)
For pure IQ4_KS (imat): PPL = 15.4225 +/- 0.14537
For pure Q4_K (imat): PL = 14.9259 +/- 0.13876
For pure IQ4_NL (imat): PPL = 14.6848 +/- 0.13511
For pure IQ4_K (imat): PPL = 15.5956 +/- 0.14764

NexesQuants
/

google_gemma-3-4b-it-qat-q4_0-unquantized-iMat-NXS-GGUF

Benchs

Model tree for NexesQuants/google_gemma-3-4b-it-qat-q4_0-unquantized-iMat-NXS-GGUF