bartowski
/

deepseek-ai_DeepSeek-V3-0324-GGUF

Text Generation

Model card Files Files and versions

bartowski commited on Apr 1

Commit

56a1ba3

·

verified ·

1 Parent(s): 405896b

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -28,7 +28,7 @@ Run them directly with [llama.cpp](https://github.com/ggerganov/llama.cpp), or a
 I finally decided to go through llama-quant.cpp and update some of the tensor types, especially for MoE models, since they've kind of been left as-is since the original Mixtral.
-These changes overall apply a bit more logic to the types, bumping a few values here and there across the board. These changes seem to have an overall positive impact on the results.
 IQ2_XXS may not be final, the size increase is quite substantial so I may want to claw it back a bit to keep it in a better spot, still working but wanted to explain these new uploads. PR to llama.cpp will be opened when I'm done investigating.

 I finally decided to go through llama-quant.cpp and update some of the tensor types, especially for MoE models, since they've kind of been left as-is since the original Mixtral.
+These changes overall apply a bit more logic to the types, bumping a few values here and there across the board. These changes seem to have an overall positive impact on the results. They're similar to what Unsloth accomplished but they're in a more generic (and hopefully upstreamable) way.
 IQ2_XXS may not be final, the size increase is quite substantial so I may want to claw it back a bit to keep it in a better spot, still working but wanted to explain these new uploads. PR to llama.cpp will be opened when I'm done investigating.