Comment

Comment on Announcing ARC-AGI-3 - An benchmark that tests if AI can explore, learn, and adapt in unfamiliar situations. Humans score 100%. Frontier AI scores 0.26%.

UnrepentantAlgebra@lemmy.world ⁨2⁩ ⁨months⁩ ago

If human scores were included, they would be at 100%, at the cost of approximately $250

Wait, why did it cost real humans $250 to pass the test?

Sort:hotnew top

mapleseedfall@lemmy.world ⁨2⁩ ⁨months⁩ ago
Youd have to eat $250 worth of burgers to pass it.

source
FrankFrankson@lemmy.world ⁨2⁩ ⁨months⁩ ago
Thatvis how much individual testing humans cost when you buy them in bulk.

source
ExLisper@lemmy.curiana.net ⁨2⁩ ⁨months⁩ ago
Because I ain’t doing this shit for free.

source