Multilingual-Multimodal-NLP/IndustrialCoder

Update README.md

by zwpride-iquestlab - opened 9 days ago

←

Files changed (1) hide show

README.md CHANGED Viewed

@@ -137,6 +137,13 @@ outputs = model.generate(
 print(tokenizer.decode(outputs[0], skip_special_tokens=True))
 ```
 ### Fill-in-the-Middle (FIM)
 InCoder-32B supports FIM completion for code infilling tasks:

 print(tokenizer.decode(outputs[0], skip_special_tokens=True))
 ```
+### Deployment with vLLM
+For production deployment, you can use vLLM to create an OpenAI-compatible API endpoint.
+```
+vllm serve Multilingual-Multimodal-NLP/IndustrialCoder --tensor-parallel-size 8
+```
 ### Fill-in-the-Middle (FIM)
 InCoder-32B supports FIM completion for code infilling tasks: