Nothin’ I’m running, that’s for sure!
It’s not really that there are services that require that much processing power for a single request; it’s that it’s designed to handle normal requests for hundreds or thousands of users at once.
I suppose that supporting 0.5TB of RAM means it could deal with quite a big LLM, but any sort of halfway-modern GPU would absolutely run circles around it in terms of tokens per second, on any model that fit in their VRAM.
MangoPenguin@lemmy.blahaj.zone 4 days ago
That N200 is likely on par or faster than dual Opteron 6272 CPUs, since they are so old.
spaghettiwestern@sh.itjust.works 4 days ago
A single Opteron 6272 is somewhat faster than the N200, but the Opteron’s TDP is 115 watts while the N200’s is only 6 watts. OP’s server with 2 processors is more than 2x as fast as my single processor laptop, but can require nearly 40x the electricity. For a home server it’s major overkill.
MangoPenguin@lemmy.blahaj.zone 4 days ago
Newer CPUs can also just be better optimized and have more faster cache and that sort of thing, so might be faster at running a process even if they’re the same on paper.