Comment on Announcing ARC-AGI-3 - An benchmark that tests if AI can explore, learn, and adapt in unfamiliar situations. Humans score 100%. Frontier AI scores 0.26%.

lath@lemmy.world ⁨13⁩ ⁨hours⁩ ago

Biased study. Take any average person off the streets and shove this thing in their face. That 100% notion will go down fast.

source
Sort:hotnewtop