Comment

Comment on Announcing ARC-AGI-3 - An benchmark that tests if AI can explore, learn, and adapt in unfamiliar situations. Humans score 100%. Frontier AI scores 0.26%.

tatterdemalion@programming.dev ⁨2⁩ ⁨months⁩ ago

LLMs might suck at this game but I’m pretty sure Deepmind’s deep reinforcement learning AI could solve these easily.

Sort:hotnew top

33550336@lemmy.world ⁨2⁩ ⁨months⁩ ago
if only it would exist

source
- tatterdemalion@programming.dev ⁨2⁩ ⁨months⁩ ago
  Wdym? It’s existed for at least a decade. Plenty of papers about it. It mastered Atari and Mario. It became the best Go player.
  
  source