Comment on Announcing ARC-AGI-3 - An benchmark that tests if AI can explore, learn, and adapt in unfamiliar situations. Humans score 100%. Frontier AI scores 0.26%.

<- View Parent
lath@lemmy.world ⁨11⁩ ⁨hours⁩ ago

Stress test it. Low, average, high, impairment conditions, safeguards off, order, chaos and everything in between.

source
Sort:hotnewtop