bartowski
/

THUDM_GLM-4-32B-0414-GGUF

Text Generation

Model card Files Files and versions

Resources

View closed (2)

Promt time (ollama) on 22c xenon, 5070 ti, 128GB ram. (Q6_K_L)

#12 opened 4 months ago by

Template bug fixed in llama.cpp

#11 opened 4 months ago by

matteogeniaccio

vllm depolyment error

#10 opened 5 months ago by

Higher than usual refusal rate with Q6_K_L quant GGUF

#9 opened 5 months ago by

Tool use?

#8 opened 5 months ago by

llama.cpp fixes have just been merged

#5 opened 5 months ago by

LM Studio: unknown model architecture: 'glm4'?

#4 opened 5 months ago by

please regenerate ggufs

#3 opened 5 months ago by

Broken results

#2 opened 5 months ago by

Yarn quantization for long context

#1 opened 5 months ago by