Exactly. The difference between a cached response and a live one even for non-AI queries is an OOM difference.
At this point, a lot of people just care about the ‘feel’ of anti-AI articles even if the substance is BS though.
And then people just feed whatever gets clicks and shares.
quick@thelemmy.club 4 months ago
Googles tpu can’t handle llm’s lol. What do you mean “exactly”?
kromem@lemmy.world 4 months ago
Did you think Google’s only TPUs are the ones in the Pixel phones, and didn’t know that they have server TPUs?