GGUF
English
Not-For-All-Audiences
conversational
Artefact2 commited on
Commit
dc5d62b
·
verified ·
1 Parent(s): dc5ab1d

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -0
README.md ADDED
@@ -0,0 +1,15 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ license: cc-by-nc-4.0
5
+ tags:
6
+ - not-for-all-audiences
7
+ ---
8
+
9
+ These are GGUF quantized versions of [Envoid/Cat-8x7B](https://huggingface.co/Envoid/Cat-8x7B).
10
+
11
+ The importance matrix was trained for 100K tokens (200 batches of 512 tokens) using `wiki.train.raw`.
12
+
13
+ The IQ2_XXS and IQ2_XS versions are compatible with llama.cpp, version `147b17a` or later. The IQ3_XXS requires version `f4d7e54` or later.
14
+
15
+ Some model files above 50GB are split into smaller files. To concatenate them, use the `cat` command (on Windows, use PowerShell): `cat foo-Q6_K.gguf.* > foo-Q6_K.gguf`