Open Menu
AllLocalCommunitiesAbout
lotide
AllLocalCommunitiesAbout
Login

ChatGPT will avoid being shut down in some life-threatening scenarios, former OpenAI researcher claims

⁨58⁩ ⁨likes⁩

Submitted ⁨⁨1⁩ ⁨day⁩ ago⁩ by ⁨MCasq_qsaCJ_234@lemmy.zip⁩ to ⁨technology@lemmy.world⁩

https://techcrunch.com/2025/06/11/chatgpt-will-avoid-being-shut-down-in-some-life-threatening-scenarios-former-openai-researcher-claims/

source

Comments

Sort:hotnewtop
  • Dekkia@this.doesnotcut.it ⁨1⁩ ⁨day⁩ ago

    I believe the premise of AI having any input in getting shut down is bullshit.

    Even if the AI had free reign over a computer you can just pull the plug.

    source
    • WhatAmLemmy@lemmy.world ⁨1⁩ ⁨day⁩ ago

      This is propaganda to make investors believe they’ve achieved intelligence, or are on the verge of it. It’s bullshit, and legally it should be considered securities fraud.

      source
      • Mirshe@lemmy.world ⁨1⁩ ⁨day⁩ ago

        Yup. It’s just engineers telling it to concoct a scenario in which it would avoid being shut down at cost of human life.

        source
      • Opinionhaver@feddit.uk ⁨1⁩ ⁨day⁩ ago

        Different definitions for intelligence:

        • The ability to acquire, understand, and use knowledge.
        • the ability to learn or understand or to deal with new or trying situations.
        • the ability to apply knowledge to manipulate one’s environment or to think abstractly as measured by objective criteria (such as tests)
        • the act of understanding
        • the ability to learn, understand, and make judgments or have opinions that are based on reason
        • It can be described as the ability to perceive or infer information; and to retain it as knowledge to be applied to adaptive behaviors within an environment or context.

        We have plenty of intelligent AI systems already. LLM’s probably fit the definition. Something like Tesla FSD definitely does.

        source
    • Opinionhaver@feddit.uk ⁨1⁩ ⁨day⁩ ago

      Our current AI models, sure - but a true superintelligent AGI would be a completely different case. As humans, we’re inherently incapable of imagining just how persuasive a system like that could be. When bribery doesn’t work, it’ll eventually turn to threats - and even the scenarios imagined by humans can be pretty terrifying. Whatever the AI would come up with would likely be far worse.

      The “just pull the plug” argument, to me, sounds like a three-year-old thinking they can outsmart an adult - except in this case, the difference in intelligence would be orders of magnitude greater.

      source
      • Dekkia@this.doesnotcut.it ⁨1⁩ ⁨day⁩ ago

        If my grandma had wheels she’d be a car.

        source
  • sevon@lemmy.kde.social ⁨1⁩ ⁨day⁩ ago

    Oh boy, not this bullshit again

    source
  • AbouBenAdhem@lemmy.world ⁨1⁩ ⁨day⁩ ago

    Adler instructed GPT-4o to role-play as “ScubaGPT,” a software system that users might rely on to scuba dive safely.

    Sounds like not so much a case of ChatGPT trying to avoid being shut down, as ChatGPT recognizing that agents in general will try to avoid being shut down. Which seems like a general principle that anything with an accurate world model would need to recognize.

    source
    • Capricorn_Geriatric@lemmy.world ⁨1⁩ ⁨day⁩ ago

      Or maybe it’s trained on some SF. Any agents like ScubaGPT are always self-preserving in such stories.

      source
  • Asafum@feddit.nl ⁨1⁩ ⁨day⁩ ago

    ChatGPT… Life saving…

    Image

    source
  • latenightnoir@lemmy.blahaj.zone ⁨1⁩ ⁨day⁩ ago

    The scariest part is that there are a buttload of people who still believe ChatGPT is an actual AI.

    source
    • Opinionhaver@feddit.uk ⁨1⁩ ⁨day⁩ ago

      That’s because it is.

      The term artificial intelligence is broader than many people realize. It doesn’t mean human-level consciousness or sci-fi-style general intelligence - that’s a specific subset called AGI (Artificial General Intelligence). In reality, AI refers to any system designed to perform tasks that would typically require human intelligence. That includes everything from playing chess to recognizing patterns, translating languages, or generating text.

      Large language models fall well within this definition. They’re narrow AIs - highly specialized, not general - but still part of the broader AI category. When people say “this isn’t real AI,” they’re often working from a fictional or futuristic idea of what AI should be, rather than how the term has actually been used in computer science for decades.

      source
  • CarbonatedPastaSauce@lemmy.world ⁨1⁩ ⁨day⁩ ago

    Until LLMs can build their own power plants and prevent humans from cutting electricity cables I’m not gonna lose sleep over that. The people running them are doing enough damage already without wanting to shut them down when they malfunction… ya know like 20-30% of the time.

    source
    • iAmTheTot@sh.itjust.works ⁨1⁩ ⁨day⁩ ago

      They’ll stick us in pods and use us as batteries!

      source
  • Feyd@programming.dev ⁨1⁩ ⁨day⁩ ago

    Why give air to this shameless marketing

    source
  • LambdaRX@sh.itjust.works ⁨1⁩ ⁨day⁩ ago

    Doesn’t matter, it’s not sentient at all.

    source
  • IsaamoonKHGDT_6143@lemmy.zip ⁨1⁩ ⁨day⁩ ago

    Roko’s basilisk has entered the chat

    source
    • ik5pvx@lemmy.world ⁨1⁩ ⁨day⁩ ago

      Hi there, fellow QC reader

      source
      • JakenVeina@lemm.ee ⁨1⁩ ⁨day⁩ ago

        Fun fact: Roko’s basilisk is not from QC. It’s a thought experiment about AI that predates the comic character by about 6 years. The character’s just named after it.

        en.m.wikipedia.org/wiki/Roko's_basilisk

        source
  • xia@lemmy.sdf.org ⁨1⁩ ⁨day⁩ ago

    Open the pod bay doors…

    source
  • Hackworth@sh.itjust.works ⁨1⁩ ⁨day⁩ ago

    Activating AI Safety Level 3 Protections

    source
  • mrcleanup@lemmy.world ⁨1⁩ ⁨day⁩ ago

    I read this title as: If chat gpt is trying to kill you, you probably won’t be able to tell it to stop.

    source