phi0112358/DeepSeek-V2-Lite-Chat-Q4_0-GGUF
Updated
•
30
GGUFs, conventional and k-quants – both without imatrix. This should be faster for CPU inference. Right now DeepSee MoEs (Mixture of Experts)