Comment on Announcing ARC-AGI-3 - An benchmark that tests if AI can explore, learn, and adapt in unfamiliar situations. Humans score 100%. Frontier AI scores 0.26%.

<- View Parent
tatterdemalion@programming.dev ⁨1⁩ ⁨week⁩ ago

Wdym? It’s existed for at least a decade. Plenty of papers about it. It mastered Atari and Mario. It became the best Go player.

source
Sort:hotnewtop