Comment on Consumer GPUs to run LLMs

umami_wasbi@lemmy.ml ⁨3⁩ ⁨days⁩ ago

Using 7900XTX with LMS. Speed are everwhere, driver dependent. With QwQ-32B-Q4_K_M, I got about 20 tok/s, with all VRAM filled.

source
Sort:hotnewtop