Quantization Config

by chriswritescode - opened 3 days ago

3 days ago

AWQModifier(
ignore=["lm_head", "re:.*mlp.gate$", "re:.*mlp.shared_expert_gate$"],

What method was applied? was the gate kept in bf16 ?

koushd

Owner 3 days ago

no idea, I did not make this quant. mirrored from model scope.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment