Relevance of GPU driver version for inference performance

robber@lemmy.ml · 20 days ago

Relevance of GPU driver version for inference performance

tal@lemmy.today · edit-2 19 days ago

On AMD hardware, I moved from rocm 6 to 7 and the associated amdgpu driver release and saw pretty noticeable inference performance improvements on an RX 7900 XTX with llama.cpp (as of rocm 7.0.2, AMD has a Debian Trixie build, BTW).

But I imagine that AMD/Nvidia, specific hardware, application, and settings are probably all major inputs into that. YMMV.

EDIT: Not to mention model being run.