Joseph717171
/

DeepHermes-3-Mistral-24B-Preview-OQ8_0-F32.EQ8_0-F32.IQ4_K-Q8_0-GGUF

GGUF

imatrix

conversational

Model card Files Files and versions

xet

Community

DeepHermes-3-Mistral-24B-Preview-OQ8_0-F32.EQ8_0-F32.IQ4_K-Q8_0-GGUF / README.md

Joseph717171

Create README.md

230ba3b verified about 1 month ago

preview code

raw

history blame

280 Bytes

Custom GGUF quants of NousResearch/DeepHermes-3-Mistral-24B-Preview, where the Output Tensors are kept at F32 or quantized to Q8_0, and the Embeddings are kept at F32 or Quantized to Q8_0. Enjoy! 🧠🔥🚀