AngelSlim
/

Qwen2_5-32B_instruct_fp8_static

woodchen7 commited on Jul 23

Commit

be2f791

verified ·

1 Parent(s): 3714224

Upload hf_quant_config.json with huggingface_hub

Files changed (1) hide show

hf_quant_config.json ADDED Viewed

+{
+    "quantization": {
+        "quant_algo": "FP8",
+        "kv_cache_quant_algo": null,
+        "exclude_modules": [
+            "lm_head",
+            "model.embed_tokens"
+        ]
+    }
+}