Melon commited on
Commit
dc5cc43
1 Parent(s): c821fbc

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -0
README.md ADDED
@@ -0,0 +1,11 @@
 
 
 
 
 
 
 
 
 
 
 
 
1
+ You can deploy it using vllm. And this is the script for deploying.
2
+ ``` bash
3
+ python -O -u -m vllm.entrypoints.openai.api_server \
4
+ --host=127.0.0.1 \
5
+ --port=8090 \
6
+ --model=Melon/Meta-Llama-3-70B-Instruct-AutoAWQ-4bit \
7
+ --tokenizer=meta-llama/Meta-Llama-3-70B-Instruct \
8
+ --tensor-parallel-size=1 \
9
+ --quantization awq \
10
+ --dtype half
11
+ ```