Comment on AI agents wrong ~70% of time: Carnegie Mellon study

<- View Parent
MangoCats@feddit.it ⁨1⁩ ⁨week⁩ ago

I’ve been R&D forever, so at my level the question isn’t “does the code work?” we pretty much assume that will take care of itself, eventually. Our critical question is: “is the code trying to do something valuable, or not?” We make all kinds of stuff do what the requirements call for it to do, but so often those requirements are asking for worthless or even counterproductive things…

source
Sort:hotnewtop