GLM-4.5-Air-GGUF / README.md

Commit History

fixup readme

e0bb40e

ubergarm commited on 29 days ago

Release IQ3_KS with perplexity info

fa40640

ubergarm commited on 29 days ago

Support added into ik_llama.cpp main branch now yay!

482e5b8

ubergarm commited on about 1 month ago

Add IQ1_KT (with iq4_nl ffn_down_exps lmao)

14e974e

ubergarm commited on about 1 month ago

update readme with IQ4_K and IQ5_KS

c738943

ubergarm commited on about 1 month ago

Uploading IQ4_K and IQ5_KSS

726f6c5

ubergarm commited on about 1 month ago

fixup IQ4_KSS recipe

d968e06

ubergarm commited on about 1 month ago

add recipes and perplexity graph

1ae325a

ubergarm commited on about 1 month ago

fixup readme

4ab18fb

ubergarm commited on Aug 6

Upload IQ2_KL and add perplexity values

6eb7b85

ubergarm commited on Aug 6

Add IQ5_K 6BPW with similar PPL as Q8_0

d047f0a

ubergarm commited on Aug 6

Add --chat-template chatglm4 again for /nothink

607ffea

ubergarm commited on Aug 5

Add BF16 Perplexity

e3c5f49

ubergarm commited on Aug 5

add two perplexity numbers

50de40a

ubergarm commited on Aug 5

Updating imatrix and IQ4_KSS

874fab4

ubergarm commited on Aug 5

May need --override-kv tokenizer.ggml.eot_token_id=int:151336

f2eb604

ubergarm commited on Aug 3

Update WARNING EXPERIMENTAL quant instructions

6aa5bab

ubergarm commited on Aug 2

WARNING EXPERIMENTAL IQ4_KSS

30a0b69

ubergarm commited on Aug 2

add BF16 and Q8_0 size info

bc32e07

ubergarm commited on Aug 2

initial commit

6a83919
verified

ubergarm commited on Aug 2

Commit History

fixup readme e0bb40e

Release IQ3_KS with perplexity info fa40640

Support added into ik_llama.cpp main branch now yay! 482e5b8

Add IQ1_KT (with iq4_nl ffn_down_exps lmao) 14e974e

update readme with IQ4_K and IQ5_KS c738943

Uploading IQ4_K and IQ5_KSS 726f6c5

fixup IQ4_KSS recipe d968e06

add recipes and perplexity graph 1ae325a

fixup readme 4ab18fb

Upload IQ2_KL and add perplexity values 6eb7b85

Add IQ5_K 6BPW with similar PPL as Q8_0 d047f0a

Add --chat-template chatglm4 again for /nothink 607ffea

Add BF16 Perplexity e3c5f49

add two perplexity numbers 50de40a

Updating imatrix and IQ4_KSS 874fab4

May need --override-kv tokenizer.ggml.eot_token_id=int:151336 f2eb604

Update WARNING EXPERIMENTAL quant instructions 6aa5bab

WARNING EXPERIMENTAL IQ4_KSS 30a0b69

add BF16 and Q8_0 size info bc32e07

initial commit 6a83919 verified

fixup readme

e0bb40e

Release IQ3_KS with perplexity info

fa40640

Support added into ik_llama.cpp main branch now yay!

482e5b8

Add IQ1_KT (with iq4_nl ffn_down_exps lmao)

14e974e

update readme with IQ4_K and IQ5_KS

c738943

Uploading IQ4_K and IQ5_KSS

726f6c5

fixup IQ4_KSS recipe

d968e06

add recipes and perplexity graph

1ae325a

fixup readme

4ab18fb

Upload IQ2_KL and add perplexity values

6eb7b85

Add IQ5_K 6BPW with similar PPL as Q8_0

d047f0a

Add --chat-template chatglm4 again for /nothink

607ffea

Add BF16 Perplexity

e3c5f49

add two perplexity numbers

50de40a

Updating imatrix and IQ4_KSS

874fab4

May need --override-kv tokenizer.ggml.eot_token_id=int:151336

f2eb604

Update WARNING EXPERIMENTAL quant instructions

6aa5bab

WARNING EXPERIMENTAL IQ4_KSS

30a0b69

add BF16 and Q8_0 size info

bc32e07

initial commit

6a83919
verified