Alsebay
/

HyouKan-3x7B

Text Generation

Mixture of Experts

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Alsebay commited on Apr 2

Commit

b51c5a5

•

1 Parent(s): 0886318

Update README.md

Files changed (1) hide show

README.md +6 -1

README.md CHANGED Viewed

@@ -5,12 +5,17 @@ tags:
 - merge
 - Roleplay
 ---
-# Experimental model
 A Experimental MoE Model that custom for all-rounded Roleplay. Well understand Character Card and high logic.
 # It's ridiculous that I can run this original version in 4bit, but can't run in GGUF version. I hope someone could help me fix the GGUF version.
 This is the following error line that I got when try loading GGUF version of this model:
 ``GGML_ASSERT: /home/runner/work/llama-cpp-python-cuBLAS-wheels/llama-cpp-python-cuBLAS-wheels/vendor/llama.cpp/ggml-cuda.cu:8431: (ncols & (ncols - 1)) == 0``
 Have try from Q2 to fp16, no hope. 😥
 # Is this model good? Want more dicussion? Let's me know in community tab! ヾ(≧▽≦*)o

 - merge
 - Roleplay
 ---
+# Experimental 3x7B model
 A Experimental MoE Model that custom for all-rounded Roleplay. Well understand Character Card and high logic.
 # It's ridiculous that I can run this original version in 4bit, but can't run in GGUF version. I hope someone could help me fix the GGUF version.
 This is the following error line that I got when try loading GGUF version of this model:
 ``GGML_ASSERT: /home/runner/work/llama-cpp-python-cuBLAS-wheels/llama-cpp-python-cuBLAS-wheels/vendor/llama.cpp/ggml-cuda.cu:8431: (ncols & (ncols - 1)) == 0``
 Have try from Q2 to fp16, no hope. 😥
+Link here: https://huggingface.co/Alsebay/HyouKan-GGUF
 # Is this model good? Want more dicussion? Let's me know in community tab! ヾ(≧▽≦*)o