This fixes a template bug in llama.cpp and ollama that is causing a performance degradation in all GLM models. The problem is caused by a wrong template detection in both projects. https://github.com/ollama/ollama/issues/10408 https://github.com/ggml-org/llama.cpp/pull/13099