Comment on [deleted]

<- View Parent
Naz@sh.itjust.works ⁨5⁩ ⁨days⁩ ago

If you are using CPU only, you need to look at very small models or the 2-bit quants.

Everything will be extremely slow otherwise.

source
Sort:hotnewtop