bartowski/TheDrummer_GLM-Steam-106B-A12B-v1-GGUF Text Generation • 0.1B • Updated 5 days ago • 4.89k • 6
bartowski/NousResearch_Hermes-4-70B-GGUF Text Generation • 0.0B • Updated 6 days ago • 2.54k • 5
PocketDoc/Dans-PersonalityEngine-V1.3.0-24b Text Generation • 24B • Updated May 23 • 1.74k • 75
MMLU Pro benchmark for GGUFs (1 shot) Collection "Not all quantized model perform good", serving framework ollama uses NVIDIA gpu, llama.cpp uses CPU with AVX & AMX • 13 items • Updated 17 days ago • 8
mradermacher/Huihui-Qwen3-30B-A3B-Thinking-2507-abliterated-i1-GGUF 31B • Updated 29 days ago • 3.92k • 5
mradermacher/Huihui-Qwen3-30B-A3B-Instruct-2507-abliterated-i1-GGUF 31B • Updated 29 days ago • 9.4k • 8