Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

chatglm-cpp推理加速比transformer推理慢很多 #225

Open
luhairong11 opened this issue Apr 8, 2024 · 0 comments
Open

chatglm-cpp推理加速比transformer推理慢很多 #225

luhairong11 opened this issue Apr 8, 2024 · 0 comments

Comments

@luhairong11
Copy link

在使用chatglm-cpp推理加速时,比transformer推理慢很多,有人遇到过这个问题吗,采用的模型是codegeex2-6b-int4

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant