Update README.md Add vLLM deployment info.
Browse files
README.md
CHANGED
@@ -231,7 +231,17 @@ generated_ids = [
|
|
231 |
response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
|
232 |
```
|
233 |
|
234 |
-
## 5.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
235 |
|
236 |
```
|
237 |
@misc{minimax2025minimax01scalingfoundationmodels,
|
@@ -245,8 +255,8 @@ response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
|
|
245 |
}
|
246 |
```
|
247 |
|
248 |
-
##
|
249 |
For general use and evaluation, we provide a [Chatbot](https://chat.minimax.io/) with online search capabilities and the [online API](https://www.minimax.io/platform) for developers. For general use and evaluation, we provide the [MiniMax MCP Server](https://github.com/MiniMax-AI/MiniMax-MCP) with video generation, image generation, speech synthesis, and voice cloning for developers.
|
250 |
|
251 |
-
##
|
252 |
Contact us at [[email protected]](mailto:[email protected]).
|
|
|
231 |
response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
|
232 |
```
|
233 |
|
234 |
+
## 5. Deployment Guide
|
235 |
+
For production deployment, we recommend using [vLLM](https://docs.vllm.ai/en/latest/) to serve MiniMax-Text-01. vLLM provides excellent performance for serving large language models with the following features:
|
236 |
+
|
237 |
+
🔥 Outstanding service throughput performance
|
238 |
+
⚡ Efficient and intelligent memory management
|
239 |
+
📦 Powerful batch request processing capability
|
240 |
+
⚙️ Deeply optimized underlying performance
|
241 |
+
|
242 |
+
For detailed deployment instructions, please refer to our [vLLM Deployment Guide](https://github.com/MiniMax-AI/MiniMax-01/blob/main/docs/vllm_deployment_guild.md).
|
243 |
+
|
244 |
+
## 6. Citation
|
245 |
|
246 |
```
|
247 |
@misc{minimax2025minimax01scalingfoundationmodels,
|
|
|
255 |
}
|
256 |
```
|
257 |
|
258 |
+
## 7. Chatbot & API
|
259 |
For general use and evaluation, we provide a [Chatbot](https://chat.minimax.io/) with online search capabilities and the [online API](https://www.minimax.io/platform) for developers. For general use and evaluation, we provide the [MiniMax MCP Server](https://github.com/MiniMax-AI/MiniMax-MCP) with video generation, image generation, speech synthesis, and voice cloning for developers.
|
260 |
|
261 |
+
## 8. Contact Us
|
262 |
Contact us at [[email protected]](mailto:[email protected]).
|