tencent
/

Hunyuan-0.5B-Instruct-FP8

hunyuan_v1_dense

compressed-tensors

Model card Files Files and versions Community

Hunyuan-0.5B-Instruct-FP8 / hf_quant_config.json

manaestras's picture

Upload hf_quant_config.json with huggingface_hub

25718c2 verified 6 days ago

history blame contribute delete

192 Bytes

	{
	"quantization": {
	"exclude_modules": [
	"lm_head",
	"model.embed_tokens"
	],
	"kv_cache_quant_algo": null,
	"quant_algo": "FP8"
	}
	}