GPU version build not using GPU #114

dspasyuk · 2023-08-06T20:33:25Z

Hi Everyone,

I am trying to build llama-node for GPU, I followed the guide in the readme https://llama-node.vercel.app/docs/cuda but the version of the llam-cpp I get from a manual build uses CPU not GPU. When I build llama-cpp directly in llama-sys folder using the following command:

make clean && LLAMA_CUBLAS=1 make -j
It gives me perfectly fine GPU executable file which works no problem.

Am I missing something?
Here is my full build commands:

git clone https://github.com/Atome-FE/llama-node.git
cd llama-node/
rustup target add x86_64-unknown-linux-musl
git submodule update --init --recursive
pnpm install --ignore-scripts
cd packages/llama-cpp/
pnpm build:cuda

Then I get libllama.so file in my ~/.llama-node which when used does not use GPU: Here my script to run it:

import { LLM } from "llama-node";
import { LLamaCpp } from "llama-node/dist/llm/llama-cpp.js";
import path from "path";
const model = path.resolve(process.cwd(), "~/CODE/models/vicuna-7b-v1.3.ggmlv3.q4_0.bin");
const llama = new LLM(LLamaCpp);
const config = {
modelPath: model,
enableLogging: true,
nCtx: 1024,
seed: 0,
f16Kv: false,
logitsAll: false,
vocabOnly: false,
useMlock: false,
embedding: false,
useMmap: true,
nGpuLayers: 40
};
const template = How do I train you to read my documents?;
const prompt = A chat between a user and an assistant. USER: ${template} ASSISTANT:;
const params = {
nThreads: 4,
nTokPredict: 2048,
topK: 40,
topP: 0.1,
temp: 0.2,
repeatPenalty: 1,
prompt,
};
const run = async () => {
await llama.load(config);
await llama.createCompletion(params, (response) => {
process.stdout.write(response.token);
});
};
run();

Any help appreciated

The text was updated successfully, but these errors were encountered:

shaileshminsnapsys · 2023-08-17T23:18:06Z

I am facing the same issue. Can anyone please guide us on this?

dspasyuk · 2023-08-18T02:35:04Z

I ended up using just llama.cpp. Works very well on the GPU. You can write a simple wrapper in nodejs without rust. I can share the code if you want.

shaileshminsnapsys · 2023-08-18T05:50:36Z

@deonis1 it will be a great help. Please share the code.

dspasyuk · 2023-08-18T14:15:13Z

@shaileshminsnapsys no problem the code is here https://github.com/deonis1/llcui

shaileshminsnapsys · 2023-08-18T14:39:47Z

Thank you @deonis1 , I'll check with the code.

Thank you for your help.

dspasyuk · 2023-08-18T17:44:34Z

Let me know if you have any issues

shaileshminsnapsys · 2023-08-22T06:35:26Z

@deonis1 Thank you so much, your code help me alot to achieve my target.

Many Thanks !!

dspasyuk · 2023-08-22T20:02:14Z

@shaileshminsnapsys no problem, there is a new version if you are interested

shaileshminsnapsys · 2023-08-23T04:26:07Z

@deonis1 would love to see the new version. Thank you

dspasyuk · 2023-09-20T21:31:01Z

@shaileshminsnapsys The new version that supports embedding (mongodb or text document) is released. You can find it under the new url:
https://github.com/deonis1/llama.cui

shaileshminsnapsys · 2023-09-21T04:23:59Z

@deonis1
Wow, its amazing.. Thanks, I'll give a try to it for sure.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPU version build not using GPU #114

GPU version build not using GPU #114

dspasyuk commented Aug 6, 2023 •

edited

shaileshminsnapsys commented Aug 17, 2023

dspasyuk commented Aug 18, 2023 •

edited

shaileshminsnapsys commented Aug 18, 2023

dspasyuk commented Aug 18, 2023

shaileshminsnapsys commented Aug 18, 2023

dspasyuk commented Aug 18, 2023

shaileshminsnapsys commented Aug 22, 2023 •

edited

dspasyuk commented Aug 22, 2023

shaileshminsnapsys commented Aug 23, 2023

dspasyuk commented Sep 20, 2023

shaileshminsnapsys commented Sep 21, 2023

GPU version build not using GPU #114

GPU version build not using GPU #114

Comments

dspasyuk commented Aug 6, 2023 • edited

shaileshminsnapsys commented Aug 17, 2023

dspasyuk commented Aug 18, 2023 • edited

shaileshminsnapsys commented Aug 18, 2023

dspasyuk commented Aug 18, 2023

shaileshminsnapsys commented Aug 18, 2023

dspasyuk commented Aug 18, 2023

shaileshminsnapsys commented Aug 22, 2023 • edited

dspasyuk commented Aug 22, 2023

shaileshminsnapsys commented Aug 23, 2023

dspasyuk commented Sep 20, 2023

shaileshminsnapsys commented Sep 21, 2023

dspasyuk commented Aug 6, 2023 •

edited

dspasyuk commented Aug 18, 2023 •

edited

shaileshminsnapsys commented Aug 22, 2023 •

edited