Update README.md

https://github.com/turboderp-org/exllamav3/commit/c44e56c73b2c67eee087c7195c9093520494d3bf
https://github.com/turboderp-org/exllamav2/commit/de19cbcc599353d5aee1fec8c1ce2806f890baca
https://huggingface.co/Disya/GLM4-9B-Neon-v2-exl2-5.5bpw-h8
exl3 and exl2 supports

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -58,7 +58,7 @@ To run GGUFs correctly, you need the most recent version of KoboldCPP, and to pa
 Thanks to DaringDuck and tofumagnate for info how to apply this fix.
-To run this model on vLLM, you'll need to build it from source from the git repo, GLM4 support haven't reached release yet. ExLLaMAv2 and v3 don't support GLM4 arch at the moment
 ---

 Thanks to DaringDuck and tofumagnate for info how to apply this fix.
+To run this model on vLLM, you'll need to build it from source from the git repo, GLM4 support haven't reached release yet.
 ---