allura-org
/

GLM4-9B-Neon-v2

Text Generation

Model card Files Files and versions

AuriAetherwiing commited on Apr 27

Commit

3472213

·

verified ·

1 Parent(s): 2708d42

Update README.md

Files changed (1) hide show

README.md +5 -1

README.md CHANGED Viewed

@@ -58,7 +58,11 @@ To run GGUFs correctly, you need the most recent version of KoboldCPP, and to pa
 Thanks to DaringDuck and tofumagnate for info how to apply this fix.
-To run this model on vLLM, you'll need to build it from source from the git repo, GLM4 support haven't reached release yet.
 ---

 Thanks to DaringDuck and tofumagnate for info how to apply this fix.
+To run this model on vLLM, you'll need to build it from source from the git repo, full GLM4 support hasn't reached release yet.
+ExLLaMAv2 and v3 based backends, such as TabbyAPI should support the model out of the box.
+Latest versions of llama.cpp server should also allow running GGUFs out-of-the-box.
 ---