Comment on Oracle Layoffs: Tech giant to slash 30,000 jobs as banks pull out from financing AI data centres | Company Business News

<- View Parent
CileTheSane@lemmy.ca ⁨3⁩ ⁨days⁩ ago

I think you are underestimating how accurate LLMs are because you probably don’t use them much, and only see there mistakes posted for memes. No one’s going to post the 99 times an LLM gives the correct answer, but the one time it says to put glue on pizza it’s going to go viral. So if your only view on LLM output is from posts, you’re going to think it’s way worse than it is.

And look at what is on my feed just this morning: lemmy.world/post/44099386

It’s not just that LLMs are shit. It’s that people trust them way too much and are shocked when the predictable happens.

Even if you mark it down for incorrect answers it’s still going to beat most people. An LLM can score in the 90th percentile in the SAT, and around the 80th percentile in the LSAT.

And of course the AI bro goes for the “vibes” argument. You can’t just state that as true without providing a source. Or did AI tell you it was true?

For example: fewer than 10% of tested AIs consistently properly answered that you need to drive to a car wash in order to wash your car: opper.ai/blog/car-wash-test

That’s a question so far below anything on the SAT or LSAT and 90% of LLMs can’t even get that right.

If you’re doubting my percentages on the accuracy of LLMs I’d encourage you to test them yourself.

I’ve tried using LLMs. I don’t use them for research, because why the fuck would I? Better, more efficient tools already exist for that. When I had something that a search engine can’t help me with and LLMs are apparently “good at” it immediately proved itself to be worthless.

source
Sort:hotnewtop