Comment on AI agents wrong ~70% of time: Carnegie Mellon study

lepinkainen@lemmy.world ⁨1⁩ ⁨week⁩ ago

Wrong 70% doing what?

I’ve used LLMs as a Stack Overflow / MSDN replacement for over a year and if they fucked up 7/10 questions I’d stop.

Same with code, any free model can easily generate simple scripts and utilities with maybe 10% error rate, definitely not 70%

source
Sort:hotnewtop