Can you run DeepSeek R1 on a AMD 7900 XTX 24GB GPU?

Zeon@lemmy.world · edit-2 3 days ago

Can you run DeepSeek R1 on a AMD 7900 XTX 24GB GPU?

Domi@lemmy.secnd.me · 2 days ago

I run the 32b one on my 7900 XTX in Alpaca https://jeffser.com/alpaca/

There is no way to fit the full model in any single AMD or Nvidia GPU in existence.

Eager Eagle@lemmy.world · 2 days ago

check this out

https://apxml.com/posts/gpu-requirements-deepseek-r1

The Hobbyist@lemmy.zip · 3 days ago

To run the full 671B sized model (404GB in size), you would need more than 404GB of combined GPU memory and standard memory (and that’s only to run it, you would most probably want it all to be GPU memory to make it run fast).

With 24GB of GPU memory, the largest model which would fit from the R1 series would be the 32b-qwen-distill-q4_K_M (20GB in size) available at ollama (and possibly elsewhere).

wuphysics87@lemmy.ml · 2 days ago

I run it on a 6700xt

Fisch@discuss.tchncs.de · 3 days ago

I don’t know how big the original model is but I have an RX 6700 XT and I can easily run the Llama 3 8B distill of Deepseek R1 with 32k context. I just haven’t figured out how to get good results yet, it always does the <thinking><thinking/> thing.

Here_for_the_dudes@sh.itjust.works · 3 days ago

I run the 32b Version on my 6700xt with an R9 3700x using ollama. It runs well but it gets a bit slower on complex problems. I once ran an 70b Llama model, but it took a long time to finish.

Eager Eagle@lemmy.world · edit-2 3 days ago

They run smaller variations of it in their personal machines. There are models that fit in almost any machine, but IME the first model that is useful is the 32b, which you can probably run on the XTX. Anything less than that, only for the more trivial tasks.