4-bit OmniQuant quantized version of DeepSeek-R1-Distill-Llama-8B for inference with the Private LLM app.

Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for numen-tech/DeepSeek-R1-Distill-Llama-8B-w4a16g128asym

Quantized
(178)
this model