Comment on AI agents wrong ~70% of time: Carnegie Mellon study

<- View Parent
MangoCats@feddit.it ⁨2⁩ ⁨weeks⁩ ago

being able to do 30% of tasks successfully is already useful.

If you have a good testing program, it can be.

If you use AI to write the test cases…? I wouldn’t fly on that airplane.

source
Sort:hotnewtop