Comment

Comment on ChatGPT 'got absolutely wrecked' by Atari 2600 in beginner's chess match — OpenAI's newest model bamboozled by 1970s logic

IsaamoonKHGDT_6143@lemmy.zip ⁨1⁩ ⁨week⁩ ago

They used ChatGPT 4o, instead of using o1 or o3.

Obviously it was going to fail.

Sort:hotnew top

wizardbeard@lemmy.dbzer0.com ⁨1⁩ ⁨week⁩ ago
Other studies (not all chess based or against this old chess AI) show similar lackluster results when using reasoning models.

source