Update README.md Add vLLM deployment info.

#31
by QscQ - opened
Files changed (1) hide show
  1. README.md +13 -3
README.md CHANGED
@@ -231,7 +231,17 @@ generated_ids = [
231
  response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
232
  ```
233
 
234
- ## 5. Citation
 
 
 
 
 
 
 
 
 
 
235
 
236
  ```
237
  @misc{minimax2025minimax01scalingfoundationmodels,
@@ -245,8 +255,8 @@ response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
245
  }
246
  ```
247
 
248
- ## 6. Chatbot & API
249
  For general use and evaluation, we provide a [Chatbot](https://chat.minimax.io/) with online search capabilities and the [online API](https://www.minimax.io/platform) for developers. For general use and evaluation, we provide the [MiniMax MCP Server](https://github.com/MiniMax-AI/MiniMax-MCP) with video generation, image generation, speech synthesis, and voice cloning for developers.
250
 
251
- ## 7. Contact Us
252
  Contact us at [[email protected]](mailto:[email protected]).
 
231
  response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
232
  ```
233
 
234
+ ## 5. Deployment Guide
235
+ For production deployment, we recommend using [vLLM](https://docs.vllm.ai/en/latest/) to serve MiniMax-Text-01. vLLM provides excellent performance for serving large language models with the following features:
236
+
237
+ 🔥 Outstanding service throughput performance
238
+ ⚡ Efficient and intelligent memory management
239
+ 📦 Powerful batch request processing capability
240
+ ⚙️ Deeply optimized underlying performance
241
+
242
+ For detailed deployment instructions, please refer to our [vLLM Deployment Guide](https://github.com/MiniMax-AI/MiniMax-01/blob/main/docs/vllm_deployment_guild.md).
243
+
244
+ ## 6. Citation
245
 
246
  ```
247
  @misc{minimax2025minimax01scalingfoundationmodels,
 
255
  }
256
  ```
257
 
258
+ ## 7. Chatbot & API
259
  For general use and evaluation, we provide a [Chatbot](https://chat.minimax.io/) with online search capabilities and the [online API](https://www.minimax.io/platform) for developers. For general use and evaluation, we provide the [MiniMax MCP Server](https://github.com/MiniMax-AI/MiniMax-MCP) with video generation, image generation, speech synthesis, and voice cloning for developers.
260
 
261
+ ## 8. Contact Us
262
  Contact us at [[email protected]](mailto:[email protected]).