Comment on Edward Snowden slams Nvidia's RTX 50-series 'F-tier value,' whistleblows on lackluster VRAM capacity

<- View Parent
Viri4thus@feddit.org ⁨2⁩ ⁨days⁩ ago

Ty. I’ll try ollama with the Q-4-M quantization. I wouldn’t expect to see a difference between ollama and SGlang.

source
Sort:hotnewtop