GPUs #2

fakerybakery · 2023-12-09T01:45:50Z

Hi,
Great repo! You mentioned you need quite a few A100s. If this model is ~50B parameters and ppl can run Llama 2 70B on 1xA100, why does this take so much compute?
Thank you!

vikhyat · 2023-12-09T02:06:09Z

I've never tried Llama 70B, but this is running in fp16 without any quantization. That might be part of it?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPUs #2

GPUs #2

fakerybakery commented Dec 9, 2023

vikhyat commented Dec 9, 2023

GPUs #2

GPUs #2

Comments

fakerybakery commented Dec 9, 2023

vikhyat commented Dec 9, 2023