dranger003
commited on
Commit
•
1aaa8cd
1
Parent(s):
5e54cb4
Update README.md
Browse files
README.md
CHANGED
@@ -7,7 +7,7 @@ base_model: CohereForAI/c4ai-command-r-plus
|
|
7 |
**2024-04-06**: Support for this model is still being worked on - [`PR #6491`](https://github.com/ggerganov/llama.cpp/pull/6491).
|
8 |
I am currently re-uploading all the quants compatible with the PR.
|
9 |
|
10 |
-
* What
|
11 |
* How do I use imatrix quants? Just like any other GGUF, the `.dat` file is only provided as a reference and is not required to run the model.
|
12 |
* GGUF importance matrix (imatrix) quants for https://huggingface.co/CohereForAI/c4ai-command-r-plus
|
13 |
* The importance matrix is trained for ~100K tokens (200 batches of 512 tokens) using [wiki.train.raw](https://huggingface.co/datasets/wikitext).
|
|
|
7 |
**2024-04-06**: Support for this model is still being worked on - [`PR #6491`](https://github.com/ggerganov/llama.cpp/pull/6491).
|
8 |
I am currently re-uploading all the quants compatible with the PR.
|
9 |
|
10 |
+
* What is importance matrix (imatrix)? You can [read more about it from the author here](https://github.com/ggerganov/llama.cpp/pull/4861).
|
11 |
* How do I use imatrix quants? Just like any other GGUF, the `.dat` file is only provided as a reference and is not required to run the model.
|
12 |
* GGUF importance matrix (imatrix) quants for https://huggingface.co/CohereForAI/c4ai-command-r-plus
|
13 |
* The importance matrix is trained for ~100K tokens (200 batches of 512 tokens) using [wiki.train.raw](https://huggingface.co/datasets/wikitext).
|