Update README.md
Browse files
README.md
CHANGED
@@ -13,9 +13,11 @@ tags:
|
|
13 |
|
14 |
# Model Card
|
15 |
|
16 |
-
Dynamic quantization of DeepSeek-R1-0528
|
17 |
|
18 |
-
THIS QUANT ONLY
|
|
|
|
|
19 |
|
20 |
See [this detailed guide](https://github.com/ikawrakow/ik_llama.cpp/discussions/258) on how to setup an run **ik_llama**.
|
21 |
|
|
|
13 |
|
14 |
# Model Card
|
15 |
|
16 |
+
Dynamic quantization of DeepSeek-R1-0528 using optimized format only available on **ik_llama** fork, resized to run with 24GB to 32GB VRAM and 512GB RAM systems while providing the best balance between quality and performance for coding.
|
17 |
|
18 |
+
THIS QUANT USES OPTIMIZED FORMAT THAT IS ONLY AVAILABLE ON **IK_LLAMA** FORK!!!
|
19 |
+
|
20 |
+
Note that **ik_llama** can run all the **llama.cpp** quants, while adding support for interleaved formats (_R4 and _R8).
|
21 |
|
22 |
See [this detailed guide](https://github.com/ikawrakow/ik_llama.cpp/discussions/258) on how to setup an run **ik_llama**.
|
23 |
|