Comment on AI agents wrong ~70% of time: Carnegie Mellon study

<- View Parent
loonsun@sh.itjust.works ⁨1⁩ ⁨week⁩ ago

It’s about Agents, which implies multi step as those are meant to execute a series of tasks opposed to studies looking at base LLM model performance.

source
Sort:hotnewtop