Commit History

add perplexity comparison with intel auto-round quant
de2a086

ubergarm commited on

still testing vulkan backend added note
7a18aa9

ubergarm commited on

add perplexity graph
ebceac5

ubergarm commited on

Add small GPU optimized KT quants and quickstart
c6f5281

ubergarm commited on

Upload Qwen3-30B-A3B-Instruct-2507-IQ2_KT.gguf with huggingface_hub
cf52545
verified

ubergarm commited on

Upload Qwen3-30B-A3B-Instruct-2507-IQ1_KT.gguf with huggingface_hub
169c02a
verified

ubergarm commited on

Upload Qwen3-30B-A3B-Instruct-2507-IQ2_KL.gguf with huggingface_hub
f18ac32
verified

ubergarm commited on

Add IQ2_KL
b1c3767

ubergarm commited on

Upload Qwen3-30B-A3B-Instruct-2507-IQ3_KS.gguf with huggingface_hub
c81859d
verified

ubergarm commited on

Upload Qwen3-30B-A3B-Instruct-2507-IQ3_K.gguf with huggingface_hub
86b7200
verified

ubergarm commited on

Add IQ3_K and IQ3_KS
f67fb1e

ubergarm commited on

Upload Qwen3-30B-A3B-Instruct-2507-IQ4_KSS.gguf with huggingface_hub
6b3a920
verified

ubergarm commited on

Upload Qwen3-30B-A3B-Instruct-2507-IQ4_K.gguf with huggingface_hub
8c72d73
verified

ubergarm commited on

Add IQ4_K and IQ4_KSS
297f6bc

ubergarm commited on

Upload imatrix-eaddario-combined-all-medium-Qwen3-30B-A3B-Instruct-2507-BF16.dat with huggingface_hub
c5c524d
verified

ubergarm commited on

Upload Qwen3-30B-A3B-Instruct-2507-IQ5_K.gguf with huggingface_hub
001b3c8
verified

ubergarm commited on

upload IQ5_K
1ffa6d4

ubergarm commited on

initial commit
a66cdac

ubergarm commited on

initial commit
4ec6856
verified

ubergarm commited on