second-state
/

Phi-3-medium-4k-instruct-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

apepkuss79 commited on May 26, 2024

Commit

d9c38c1

·

verified ·

1 Parent(s): 7072b66

Update README.md

Files changed (1) hide show

README.md +3 -5

README.md CHANGED Viewed

@@ -30,9 +30,7 @@ tags:
 ## Run with LlamaEdge
-<!-- - LlamaEdge version: [v0.10.2](https://github.com/LlamaEdge/LlamaEdge/releases/tag/0.10.2) and above -->
-- LlamaEdge version: coming soon
 - Prompt template
@@ -54,7 +52,7 @@ tags:
 - Context size: `4000`
-<!-- - Run as LlamaEdge service
   ```bash
   wasmedge --dir .:. --nn-preload default:GGML:AUTO:Phi-3-medium-4k-instruct-Q5_K_M.gguf \
@@ -72,7 +70,7 @@ tags:
     --prompt-template phi-3-chat \
     --ctx-size 4000
   ```
- -->
 ## Quantized GGUF Models
 | Name | Quant method | Bits | Size | Use case |

 ## Run with LlamaEdge
+- LlamaEdge version: [v0.11.2](https://github.com/LlamaEdge/LlamaEdge/releases/tag/0.11.2) and above
 - Prompt template
 - Context size: `4000`
+- Run as LlamaEdge service
   ```bash
   wasmedge --dir .:. --nn-preload default:GGML:AUTO:Phi-3-medium-4k-instruct-Q5_K_M.gguf \
     --prompt-template phi-3-chat \
     --ctx-size 4000
   ```
 ## Quantized GGUF Models
 | Name | Quant method | Bits | Size | Use case |