Comment on Consumer hardware is no longer a priority for manufacturers

<- View Parent
brucethemoose@lemmy.world ⁨2⁩ ⁨days⁩ ago

This is not true. I have a single 3090 + 128GB CPU RAM (which wasn’t so expensive that long ago), and I can run GLM 4.6 350B at 6 tokens/sec. I can run sparser models like Stepfun 3.5, GLM Air or Minimax 2.1 much faster, and these are all better than the cheapest API models.

source
Sort:hotnewtop