Comment on Announcing ARC-AGI-3 - An benchmark that tests if AI can explore, learn, and adapt in unfamiliar situations. Humans score 100%. Frontier AI scores 0.26%.
General_Effort@lemmy.world 4 days ago
ARC-AGI-3
What happened to ARC-AGI-1 and -2?