Comment on [JS Required] How Performant are LLM Agents(AI Chatbots) on Real World Work Tasks? They Fail 70% or More of The Time.

unpossum@sh.itjust.works ⁨13⁩ ⁨hours⁩ ago

It’s easy to forget how fucking sci-fi the existence of these models is. I’m kind of excited to see where agent frameworks are in five years time, as well as a bit apprehensive…

source
Sort:hotnewtop