GLM-4.5-Air-GGUF / README.md

Commit History

fixup readme
e0bb40e

ubergarm commited on

Release IQ3_KS with perplexity info
fa40640

ubergarm commited on

Support added into ik_llama.cpp main branch now yay!
482e5b8

ubergarm commited on

Add IQ1_KT (with iq4_nl ffn_down_exps lmao)
14e974e

ubergarm commited on

update readme with IQ4_K and IQ5_KS
c738943

ubergarm commited on

Uploading IQ4_K and IQ5_KSS
726f6c5

ubergarm commited on

fixup IQ4_KSS recipe
d968e06

ubergarm commited on

add recipes and perplexity graph
1ae325a

ubergarm commited on

fixup readme
4ab18fb

ubergarm commited on

Upload IQ2_KL and add perplexity values
6eb7b85

ubergarm commited on

Add IQ5_K 6BPW with similar PPL as Q8_0
d047f0a

ubergarm commited on

Add --chat-template chatglm4 again for /nothink
607ffea

ubergarm commited on

Add BF16 Perplexity
e3c5f49

ubergarm commited on

add two perplexity numbers
50de40a

ubergarm commited on

Updating imatrix and IQ4_KSS
874fab4

ubergarm commited on

May need --override-kv tokenizer.ggml.eot_token_id=int:151336
f2eb604

ubergarm commited on

Update WARNING EXPERIMENTAL quant instructions
6aa5bab

ubergarm commited on

WARNING EXPERIMENTAL IQ4_KSS
30a0b69

ubergarm commited on

add BF16 and Q8_0 size info
bc32e07

ubergarm commited on

initial commit
6a83919
verified

ubergarm commited on