teuto
  • Communities
  • Create Post
  • Create Community
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
Davriellelouna@lemmy.world to Technology@lemmy.worldEnglish · 4 days ago

OpenAI beats Elon Musk's Grok in AI chess tournament

www.bbc.com

external-link
message-square
19
link
fedilink
61
external-link

OpenAI beats Elon Musk's Grok in AI chess tournament

www.bbc.com

Davriellelouna@lemmy.world to Technology@lemmy.worldEnglish · 4 days ago
message-square
19
link
fedilink
The tournament saw models from Anthropic, Google, xAI and DeepSeek compete against each other to be crowned the top AI chess player.
alert-triangle
You must log in or register to comment.
  • IcyToes@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    2
    ·
    24 hours ago

    AI dick measuring contest?

  • acosmichippo@lemmy.world
    link
    fedilink
    English
    arrow-up
    75
    ·
    4 days ago

    Grok was thrown off by being assigned the black pieces for the match.

  • Asafum@feddit.nl
    link
    fedilink
    English
    arrow-up
    60
    ·
    4 days ago

    “Grok then generated an image of a chess board being flipped over and complained “I only lost because the JEWS own chess!” Elon Musk could not be reached for comment as he’s currently lost in a K hole.”

    • panda_abyss@lemmy.ca
      link
      fedilink
      English
      arrow-up
      21
      ·
      4 days ago

      I can’t tell if this is satire or

    • HubertManne@piefed.social
      link
      fedilink
      English
      arrow-up
      5
      arrow-down
      1
      ·
      4 days ago

      I came to say something about it flipping over the table.

  • AbouBenAdhem@lemmy.world
    link
    fedilink
    English
    arrow-up
    34
    ·
    4 days ago

    “Up until the semi finals, it seemed like nothing would be able to stop Grok 4 on its way to winning the event,” Pedro Pinhata, a writer for Chess.com, said in its coverage. “Despite a few moments of weakness, X’s AI seemed to be by far the strongest chess player… But the illusion fell through on the last day of the tournament.” He said Grok’s “unrecognizable” and “blundering” play enabled o3 to claim a succession of “convincing wins”.

    I think the main takeaway is that these models are fundamentally inconsistent, and you can never assume they’re going to be reliable based on past performance.

    • bigfondue@lemmy.world
      link
      fedilink
      English
      arrow-up
      25
      ·
      4 days ago

      And they’d both get destroyed by StockFish

      • Skullgrid@lemmy.world
        link
        fedilink
        English
        arrow-up
        15
        ·
        4 days ago

        No idea what the point of this tournament was.

        • snooggums@lemmy.world
          link
          fedilink
          English
          arrow-up
          11
          ·
          4 days ago

          Getting attention.

        • palordrolap@fedia.io
          link
          fedilink
          arrow-up
          8
          ·
          4 days ago

          D*ck measuring contest.

        • kometes@lemmy.world
          link
          fedilink
          English
          arrow-up
          4
          ·
          3 days ago

          Special Olympics

        • hanabatake@lemmy.ml
          link
          fedilink
          English
          arrow-up
          4
          ·
          4 days ago

          Fun, IA helps human players explore new ideas, games allow researchers to observe their IA interactions in other settings …

    • acosmichippo@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      ·
      4 days ago

      or they are matchup dependent based on the strategies they were trained on.

  • latenightnoir@lemmy.blahaj.zone
    link
    fedilink
    English
    arrow-up
    15
    ·
    edit-2
    4 days ago

    Meh… Robot Wars is better…

  • Repple (she/her)@lemmy.world
    link
    fedilink
    English
    arrow-up
    11
    ·
    4 days ago

    I haven’t tried in a while, but shortly after gpt4 came out I tried to play chess against it. It just completely changed the board position nearly every move making illegal moves, adding pieces etc. do current models keep track of the board and make legal moves without special prompting to help? Were these assisted by agentic tools handling state?

    • acosmichippo@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      edit-2
      4 days ago

      deleted by creator

  • RagingSnarkasm@lemmy.world
    link
    fedilink
    English
    arrow-up
    6
    ·
    4 days ago

    “I got winner.”

    –Atari 2600, probably

  • SugarCatDestroyer@lemmy.world
    link
    fedilink
    English
    arrow-up
    5
    ·
    edit-2
    4 days ago

    What useful information… It helped me so much in real life and to hell with it all lol.

  • MrSulu@lemmy.ml
    link
    fedilink
    English
    arrow-up
    2
    ·
    3 days ago

    In a formal response from Musk he said nothing meaningful.

  • m3t00@piefed.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    4 days ago

    king me

Technology@lemmy.world

technology@lemmy.world

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


  • @[email protected]
  • @[email protected]
  • @[email protected]
  • @[email protected]
Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 6.22K users / day
  • 10.9K users / week
  • 16.9K users / month
  • 39.1K users / 6 months
  • 1 local subscriber
  • 73.9K subscribers
  • 15.6K Posts
  • 663K Comments
  • Modlog
  • mods:
  • L3s@lemmy.world
  • enu@lemmy.world
  • Technopagan@lemmy.world
  • L4sBot@lemmy.world
  • L3s@hackingne.ws
  • L4s@hackingne.ws
  • BE: 0.19.11
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org