Is possible an inference with just a RTX 3080 of 10GB? #25

davidmartinrius · 2023-06-21T21:22:14Z

Hello,

I know it is very little memory, but it is what I have by now.

By default, the demo code won't inference because of cuda out of memory. I tried to reduce the batch size of the inference to just 1, but is not enough.

Do you know a way to reduce the memory consumption running the inference?

I know that the best solution is to upgrade the GPU to a RTX 3090/4090/A6000, but before that I would like to try another way if possible.

Thank you!

David Martin Rius

deepanwayx · 2023-07-04T04:59:30Z

The required VRAM is around 13GB for full precision inference with a batch size of 1

You can also try Colaboratory for inference: #10

illtellyoulater · 2023-07-07T20:09:26Z

@deepanwayx I suppose full inference precision is 32 bit, correct? If so, did you guys made any test to check whether 16 bit would still deliver good acceptable results?

deepanwayx · 2023-07-10T05:55:27Z

Yes, the full inference precision is 32-bit. We did not test with 16-bit inference.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is possible an inference with just a RTX 3080 of 10GB? #25

Is possible an inference with just a RTX 3080 of 10GB? #25

davidmartinrius commented Jun 21, 2023

deepanwayx commented Jul 4, 2023

illtellyoulater commented Jul 7, 2023

deepanwayx commented Jul 10, 2023

Is possible an inference with just a RTX 3080 of 10GB? #25

Is possible an inference with just a RTX 3080 of 10GB? #25

Comments

davidmartinrius commented Jun 21, 2023

deepanwayx commented Jul 4, 2023

illtellyoulater commented Jul 7, 2023

deepanwayx commented Jul 10, 2023