Comment on Edward Snowden slams Nvidia's RTX 50-series 'F-tier value,' whistleblows on lackluster VRAM capacity

<- View Parent
TheHobbyist@lemmy.zip ⁨3⁩ ⁨days⁩ ago

Ollama, latest version. I have it setup with Open-WebUI (though that shouldn’t matter). The 14B is around 9GB, which easily fits in the 12GB.

I’m repeating the 28 t/s from memory, but even if I’m wrong it’s easily above 20.

Specifically, I’m running this model: ollama.com/…/deepseek-r1:14b-qwen-distill-q4_K_M

source
Sort:hotnewtop