feat: support LMDeploy backend #966

zhyncs · 2024-04-25T00:53:57Z

Feature request

@aarnphm @ssheng @parano Hi OpenLLM team, thank you for your exceptional work. Currently, OpenLLM supports two backends, vLLM and PyTorch, with good usability but there is still room for improvement in terms of performance. LMDeploy has achieved a good balance between performance and usability, with recent Llama3 8B showing a 1.8x performance improvement over vLLM on LMDeploy. Performance is crucial, especially when the demand for large-scale deployment arises after meeting user requirements. Currently, Meituan is widely using internally. I strongly recommend OpenLLM to consider integrating LMDeploy and making it the default backend. You can refer to the documentation at https://lmdeploy.readthedocs.io/en/latest/ during the research and integration process. Thanks.

Motivation

No response

Other

No response

zhyncs · 2024-04-25T02:48:56Z

cc @lvhan028 @AllentDan

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support LMDeploy backend #966

feat: support LMDeploy backend #966

zhyncs commented Apr 25, 2024

zhyncs commented Apr 25, 2024

feat: support LMDeploy backend #966

feat: support LMDeploy backend #966

Comments

zhyncs commented Apr 25, 2024

Feature request

Motivation

Other

zhyncs commented Apr 25, 2024