numen-tech
/

DeepSeek-R1-Distill-Llama-8B-w4a16g128asym

Text Generation

4-bit precision

Model card Files Files and versions Community

4-bit OmniQuant quantized version of DeepSeek-R1-Distill-Llama-8B for inference with the Private LLM app.

Downloads last month: -

Model tree for numen-tech/DeepSeek-R1-Distill-Llama-8B-w4a16g128asym

Base model

deepseek-ai/DeepSeek-R1-Distill-Llama-8B

Quantized

(178)

this model