Comment

Comment on Bewildered enthusiasts decry memory price increases of 100% or more — the AI RAM squeeze is finally starting to hit PC builders where it hurts

ag10n@lemmy.world ⁨4⁩ ⁨days⁩ ago

You can use Vulkan fairly easily as long as you have 8G vram

blog.linux-ng.de/…/running-llms-with-llama-cpp-us…

Sort:hotnew top

Passerby6497@lemmy.world ⁨1⁩ ⁨day⁩ ago
Only got 4G vram, unfortunately

source
brucethemoose@lemmy.world ⁨4⁩ ⁨days⁩ ago
The key is which one, and how though.

For the really sparse models, you might be better off trying ik_llama.cpp, especially if you are targeting a ‘small’ quant.

source