apepkuss79
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -30,9 +30,7 @@ tags:
|
|
30 |
|
31 |
## Run with LlamaEdge
|
32 |
|
33 |
-
|
34 |
-
|
35 |
-
- LlamaEdge version: coming soon
|
36 |
|
37 |
- Prompt template
|
38 |
|
@@ -54,7 +52,7 @@ tags:
|
|
54 |
|
55 |
- Context size: `4000`
|
56 |
|
57 |
-
|
58 |
|
59 |
```bash
|
60 |
wasmedge --dir .:. --nn-preload default:GGML:AUTO:Phi-3-medium-4k-instruct-Q5_K_M.gguf \
|
@@ -72,7 +70,7 @@ tags:
|
|
72 |
--prompt-template phi-3-chat \
|
73 |
--ctx-size 4000
|
74 |
```
|
75 |
-
|
76 |
## Quantized GGUF Models
|
77 |
|
78 |
| Name | Quant method | Bits | Size | Use case |
|
|
|
30 |
|
31 |
## Run with LlamaEdge
|
32 |
|
33 |
+
- LlamaEdge version: [v0.11.2](https://github.com/LlamaEdge/LlamaEdge/releases/tag/0.11.2) and above
|
|
|
|
|
34 |
|
35 |
- Prompt template
|
36 |
|
|
|
52 |
|
53 |
- Context size: `4000`
|
54 |
|
55 |
+
- Run as LlamaEdge service
|
56 |
|
57 |
```bash
|
58 |
wasmedge --dir .:. --nn-preload default:GGML:AUTO:Phi-3-medium-4k-instruct-Q5_K_M.gguf \
|
|
|
70 |
--prompt-template phi-3-chat \
|
71 |
--ctx-size 4000
|
72 |
```
|
73 |
+
|
74 |
## Quantized GGUF Models
|
75 |
|
76 |
| Name | Quant method | Bits | Size | Use case |
|