tencent
/

Hunyuan-4B-Instruct-FP8

Text Generation

hunyuan_v1_dense

compressed-tensors

Model card Files Files and versions Community

manaestras commited on 6 days ago

Commit

428085f

·

verified ·

1 Parent(s): 0625145

Upload hf_quant_config.json with huggingface_hub

Files changed (1) hide show

hf_quant_config.json +10 -0

hf_quant_config.json ADDED Viewed

	@@ -0,0 +1,10 @@

+{
+    "quantization": {
+        "exclude_modules": [
+            "lm_head",
+            "model.embed_tokens"
+        ],
+        "kv_cache_quant_algo": null,
+        "quant_algo": "FP8"
+    }
+}