IsaamoonKHGDT_6143@lemmy.zip 1 week ago
They used ChatGPT 4o, instead of using o1 or o3.
Obviously it was going to fail.
IsaamoonKHGDT_6143@lemmy.zip 1 week ago
They used ChatGPT 4o, instead of using o1 or o3.
Obviously it was going to fail.
wizardbeard@lemmy.dbzer0.com 1 week ago
Other studies (not all chess based or against this old chess AI) show similar lackluster results when using reasoning models.