Open Menu
AllLocalCommunitiesAbout
lotide
AllLocalCommunitiesAbout
Login

OpenAI beats Elon Musk's Grok in AI chess tournament

⁨86⁩ ⁨likes⁩

Submitted ⁨⁨1⁩ ⁨day⁩ ago⁩ by ⁨Davriellelouna@lemmy.world⁩ to ⁨technology@lemmy.world⁩

https://www.bbc.com/news/articles/ce830l92p68o

source

Comments

Sort:hotnewtop
  • acosmichippo@lemmy.world ⁨1⁩ ⁨day⁩ ago

    Grok was thrown off by being assigned the black pieces for the match.

    source
  • Asafum@feddit.nl ⁨1⁩ ⁨day⁩ ago

    “Grok then generated an image of a chess board being flipped over and complained “I only lost because the JEWS own chess!” Elon Musk could not be reached for comment as he’s currently lost in a K hole.”

    source
    • panda_abyss@lemmy.ca ⁨1⁩ ⁨day⁩ ago

      I can’t tell if this is satire or

      source
    • HubertManne@piefed.social ⁨1⁩ ⁨day⁩ ago

      I came to say something about it flipping over the table.

      source
  • AbouBenAdhem@lemmy.world ⁨1⁩ ⁨day⁩ ago

    “Up until the semi finals, it seemed like nothing would be able to stop Grok 4 on its way to winning the event,” Pedro Pinhata, a writer for Chess.com, said in its coverage. “Despite a few moments of weakness, X’s AI seemed to be by far the strongest chess player… But the illusion fell through on the last day of the tournament.” He said Grok’s “unrecognizable” and “blundering” play enabled o3 to claim a succession of “convincing wins”.

    I think the main takeaway is that these models are fundamentally inconsistent, and you can never assume they’re going to be reliable based on past performance.

    source
    • bigfondue@lemmy.world ⁨1⁩ ⁨day⁩ ago

      And they’d both get destroyed by StockFish

      source
      • Skullgrid@lemmy.world ⁨1⁩ ⁨day⁩ ago

        No idea what the point of this tournament was.

        source
        • -> View More Comments
    • acosmichippo@lemmy.world ⁨1⁩ ⁨day⁩ ago

      or they are matchup dependent based on the strategies they were trained on.

      source
  • latenightnoir@lemmy.blahaj.zone ⁨1⁩ ⁨day⁩ ago

    Meh… Robot Wars is better…

    source
  • Repelle@lemmy.world ⁨1⁩ ⁨day⁩ ago

    I haven’t tried in a while, but shortly after gpt4 came out I tried to play chess against it. It just completely changed the board position nearly every move making illegal moves, adding pieces etc. do current models keep track of the board and make legal moves without special prompting to help? Were these assisted by agentic tools handling state?

    source
  • RagingSnarkasm@lemmy.world ⁨1⁩ ⁨day⁩ ago

    “I got winner.”

    –Atari 2600, probably

    source
  • SugarCatDestroyer@lemmy.world ⁨1⁩ ⁨day⁩ ago

    What useful information… It helped me so much in real life and to hell with it all.

    source
  • MrSulu@lemmy.ml ⁨1⁩ ⁨day⁩ ago

    In a formal response from Musk he said nothing meaningful.

    source
  • m3t00@piefed.world ⁨1⁩ ⁨day⁩ ago

    king me

    source