Comment

Comment on Announcing ARC-AGI-3 - An benchmark that tests if AI can explore, learn, and adapt in unfamiliar situations. Humans score 100%. Frontier AI scores 0.26%.

HaunchesTV@feddit.uk ⁨2⁩ ⁨months⁩ ago

Grok Reasoning: 0%

Hilarious

Sort:hotnew top

brsrklf@jlai.lu ⁨2⁩ ⁨months⁩ ago
Reasoning is woke propaganda, obviously.

source