keitokei1994
/

shisa-v1-qwen2-7b-GGUF

Model card Files Files and versions Community

keitokei1994 commited on Jul 4, 2024

Commit

c45ee6a

·

verified ·

1 Parent(s): 7f20af8

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -19,7 +19,7 @@ language:
        ```
     2. 以下のようなコマンドでFlashAttentionを有効化して実行します:
        ```
-       ./server -m ./models/shisa-v1-qwen2-7b.Q8_0.gguf -ngl 99 --port 8888 -fa
        ```
 # shisa-v1-qwen2-7b-gguf
@@ -35,5 +35,5 @@ This is a gguf format conversion of [shisa-v1-qwen2-7b](https://huggingface.co/s
       ```
     2. Run with Flash Attention enabled using a command like this:
       ```
-      ./server -m ./models/shisa-v1-qwen2-7b.Q8_0.gguf -ngl 99 --port 8888 -fa
       ```

        ```
     2. 以下のようなコマンドでFlashAttentionを有効化して実行します:
        ```
+       ./llama-server -m ./models/shisa-v1-qwen2-7b.Q8_0.gguf -ngl 99 --port 8888 -fa
        ```
 # shisa-v1-qwen2-7b-gguf
       ```
     2. Run with Flash Attention enabled using a command like this:
       ```
+      ./llama-server -m ./models/shisa-v1-qwen2-7b.Q8_0.gguf -ngl 99 --port 8888 -fa
       ```