ibm-granite
/

granite-20b-code-base-8k-GGUF

Text Generation

Model card Files Files and versions

mayank-mishra commited on May 19, 2024

Commit

abb4a54

·

1 Parent(s): d70433a

update readme

Files changed (1) hide show

README.md +8 -22

README.md CHANGED Viewed

@@ -228,29 +228,15 @@ model-index:
 # ibm-granite/granite-20b-code-base-Q4_K_M-GGUF
 This model was converted to GGUF format from [`ibm-granite/granite-20b-code-base`](https://huggingface.co/ibm-granite/granite-20b-code-base).
 Refer to the [original model card](https://huggingface.co/ibm-granite/granite-20b-code-base) for more details on the model.
-## Use with llama.cpp
-Install llama.cpp through brew.
-```bash
-brew install ggerganov/ggerganov/llama.cpp
-```
-Invoke the llama.cpp server or the CLI.
-CLI:
-```bash
-llama-cli --hf-repo ibm-granite/granite-20b-code-base-Q4_K_M-GGUF --model granite-20b-code-base.Q4_K_M.gguf -p "def generate(random_seed: int):"
-```
-Server:
-```bash
-llama-server --hf-repo ibm-granite/granite-20b-code-base-Q4_K_M-GGUF --model granite-20b-code-base.Q4_K_M.gguf -c 2048
-```
-Note: You can also use this checkpoint directly through the [usage steps](https://github.com/ggerganov/llama.cpp?tab=readme-ov-file#usage) listed in the Llama.cpp repo as well.
-```
-git clone https://github.com/ggerganov/llama.cpp &&             cd llama.cpp &&             make &&             ./main -m granite-20b-code-base.Q4_K_M.gguf -n 128
 ```

 # ibm-granite/granite-20b-code-base-Q4_K_M-GGUF
 This model was converted to GGUF format from [`ibm-granite/granite-20b-code-base`](https://huggingface.co/ibm-granite/granite-20b-code-base).
 Refer to the [original model card](https://huggingface.co/ibm-granite/granite-20b-code-base) for more details on the model.
+## Use with llama.cpp
+```shell
+git clone https://github.com/ggerganov/llama.cpp
+cd llama.cpp
+# install
+make
+# run generation
+./main -m granite-20b-code-base-Q4_K_M-GGUF/granite-20b-code-base.Q4_K_M.gguf -n 128 -p "def generate_random(x: int):" --color
 ```