Comment

Comment on Announcing ARC-AGI-3 - An benchmark that tests if AI can explore, learn, and adapt in unfamiliar situations. Humans score 100%. Frontier AI scores 0.26%.

ExLisper@lemmy.curiana.net ⁨5⁩ ⁨weeks⁩ ago

Can’t wait for this to be the new captcha.

Sort:hotnew top