AngelSlim
/

Deepseek_r1_distill_qwen-32b_fp8_static

compressed-tensors

Model card Files Files and versions

Deepseek_r1_distill_qwen-32b_fp8_static / hf_quant_config.json

woodchen7's picture

Upload hf_quant_config.json with huggingface_hub

cba5f50 verified about 2 months ago

history blame contribute delete

192 Bytes

	{
	"quantization": {
	"quant_algo": "FP8",
	"kv_cache_quant_algo": null,
	"exclude_modules": [
	"lm_head",
	"model.embed_tokens"
	]
	}
	}