@danielhanchen on Hugging Face: "Run DeepSeek-V3.1 locally on 170GB RAM with Dynamic 1-bit GGUFs!🐋 GGUFs:…"

Post

6503

Run DeepSeek-V3.1 locally on 170GB RAM with Dynamic 1-bit GGUFs!🐋
GGUFs: unsloth/DeepSeek-V3.1-GGUF

The 715GB model gets reduced to 170GB (-80% size) by smartly quantizing layers.

The 1-bit GGUF passes all our code tests & we fixed the chat template for llama.cpp supported backends.

Guide: https://docs.unsloth.ai/basics/deepseek-v3.1

Join the conversation