Comment on ChatGPT 'got absolutely wrecked' by Atari 2600 in beginner's chess match — OpenAI's newest model bamboozled by 1970s logic

<- View Parent
nednobbins@lemm.ee ⁨2⁩ ⁨weeks⁩ ago

I imagine the “author” did something like, “Search google.scholar.com find a publication where AI failed at something and write a paragraph about it.”

It’s not even as bad as the article claims.

Atari isn’t great at chess. …stackexchange.com/…/how-strong-is-each-level-of-…
Random LLMs were nearly as good 2 years ago. lmsys.org/blog/2023-05-03-arena/
LLMs that are actually trained for chess have done much better. arxiv.org/abs/2501.17186

source
Sort:hotnewtop