You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@aarnphm@ssheng@parano Hi OpenLLM team, thank you for your exceptional work. Currently, OpenLLM supports two backends, vLLM and PyTorch, with good usability but there is still room for improvement in terms of performance. LMDeploy has achieved a good balance between performance and usability, with recent Llama3 8B showing a 1.8x performance improvement over vLLM on LMDeploy. Performance is crucial, especially when the demand for large-scale deployment arises after meeting user requirements. Currently, Meituan is widely using internally. I strongly recommend OpenLLM to consider integrating LMDeploy and making it the default backend. You can refer to the documentation at https://lmdeploy.readthedocs.io/en/latest/ during the research and integration process. Thanks.
Motivation
No response
Other
No response
The text was updated successfully, but these errors were encountered:
Feature request
@aarnphm @ssheng @parano Hi OpenLLM team, thank you for your exceptional work. Currently, OpenLLM supports two backends, vLLM and PyTorch, with good usability but there is still room for improvement in terms of performance. LMDeploy has achieved a good balance between performance and usability, with recent Llama3 8B showing a 1.8x performance improvement over vLLM on LMDeploy. Performance is crucial, especially when the demand for large-scale deployment arises after meeting user requirements. Currently, Meituan is widely using internally. I strongly recommend OpenLLM to consider integrating LMDeploy and making it the default backend. You can refer to the documentation at https://lmdeploy.readthedocs.io/en/latest/ during the research and integration process. Thanks.
Motivation
No response
Other
No response
The text was updated successfully, but these errors were encountered: