ChatGPT will avoid being shut down in some life-threatening scenarios, former OpenAI researcher claims

Submitted ⁨⁨5⁩ ⁨months⁩ ago⁩ by ⁨MCasq_qsaCJ_234@lemmy.zip⁩ to ⁨technology@lemmy.world⁩

https://techcrunch.com/2025/06/11/chatgpt-will-avoid-being-shut-down-in-some-life-threatening-scenarios-former-openai-researcher-claims/

source

Comments

Sort:hotnew top

Dekkia@this.doesnotcut.it ⁨5⁩ ⁨months⁩ ago
I believe the premise of AI having any input in getting shut down is bullshit.

Even if the AI had free reign over a computer you can just pull the plug.

source
- WhatAmLemmy@lemmy.world ⁨5⁩ ⁨months⁩ ago
  This is propaganda to make investors believe they’ve achieved intelligence, or are on the verge of it. It’s bullshit, and legally it should be considered securities fraud.
  
  source
  - Mirshe@lemmy.world ⁨5⁩ ⁨months⁩ ago
    Yup. It’s just engineers telling it to concoct a scenario in which it would avoid being shut down at cost of human life.
    
    source
  - Opinionhaver@feddit.uk ⁨5⁩ ⁨months⁩ ago
    Different definitions for intelligence:
    
    The ability to acquire, understand, and use knowledge.
    
    the ability to learn or understand or to deal with new or trying situations.
    
    the ability to apply knowledge to manipulate one’s environment or to think abstractly as measured by objective criteria (such as tests)
    
    the act of understanding
    
    the ability to learn, understand, and make judgments or have opinions that are based on reason
    
    It can be described as the ability to perceive or infer information; and to retain it as knowledge to be applied to adaptive behaviors within an environment or context.
    
    We have plenty of intelligent AI systems already. LLM’s probably fit the definition. Something like Tesla FSD definitely does.
    
    source
- Opinionhaver@feddit.uk ⁨5⁩ ⁨months⁩ ago
  Our current AI models, sure - but a true superintelligent AGI would be a completely different case. As humans, we’re inherently incapable of imagining just how persuasive a system like that could be. When bribery doesn’t work, it’ll eventually turn to threats - and even the scenarios imagined by humans can be pretty terrifying. Whatever the AI would come up with would likely be far worse.
  
  The “just pull the plug” argument, to me, sounds like a three-year-old thinking they can outsmart an adult - except in this case, the difference in intelligence would be orders of magnitude greater.
  
  source
  - Dekkia@this.doesnotcut.it ⁨5⁩ ⁨months⁩ ago
    If my grandma had wheels she’d be a car.
    
    source
sevon@lemmy.kde.social ⁨5⁩ ⁨months⁩ ago
Oh boy, not this bullshit again

source
AbouBenAdhem@lemmy.world ⁨5⁩ ⁨months⁩ ago

Adler instructed GPT-4o to role-play as “ScubaGPT,” a software system that users might rely on to scuba dive safely.

Sounds like not so much a case of ChatGPT trying to avoid being shut down, as ChatGPT recognizing that agents in general will try to avoid being shut down. Which seems like a general principle that anything with an accurate world model would need to recognize.

source
- Capricorn_Geriatric@lemmy.world ⁨5⁩ ⁨months⁩ ago
  Or maybe it’s trained on some SF. Any agents like ScubaGPT are always self-preserving in such stories.
  
  source
Asafum@feddit.nl ⁨5⁩ ⁨months⁩ ago
ChatGPT… Life saving…

Image

source
latenightnoir@lemmy.blahaj.zone ⁨5⁩ ⁨months⁩ ago
The scariest part is that there are a buttload of people who still believe ChatGPT is an actual AI.

source
- Opinionhaver@feddit.uk ⁨5⁩ ⁨months⁩ ago
  That’s because it is.
  
  The term artificial intelligence is broader than many people realize. It doesn’t mean human-level consciousness or sci-fi-style general intelligence - that’s a specific subset called AGI (Artificial General Intelligence). In reality, AI refers to any system designed to perform tasks that would typically require human intelligence. That includes everything from playing chess to recognizing patterns, translating languages, or generating text.
  
  Large language models fall well within this definition. They’re narrow AIs - highly specialized, not general - but still part of the broader AI category. When people say “this isn’t real AI,” they’re often working from a fictional or futuristic idea of what AI should be, rather than how the term has actually been used in computer science for decades.
  
  source
CarbonatedPastaSauce@lemmy.world ⁨5⁩ ⁨months⁩ ago
Until LLMs can build their own power plants and prevent humans from cutting electricity cables I’m not gonna lose sleep over that. The people running them are doing enough damage already without wanting to shut them down when they malfunction… ya know like 20-30% of the time.

source
- iAmTheTot@sh.itjust.works ⁨5⁩ ⁨months⁩ ago
  They’ll stick us in pods and use us as batteries!
  
  source
Feyd@programming.dev ⁨5⁩ ⁨months⁩ ago
Why give air to this shameless marketing

source
LambdaRX@sh.itjust.works ⁨5⁩ ⁨months⁩ ago
Doesn’t matter, it’s not sentient at all.

source
IsaamoonKHGDT_6143@lemmy.zip ⁨5⁩ ⁨months⁩ ago
Roko’s basilisk has entered the chat

source
- ik5pvx@lemmy.world ⁨5⁩ ⁨months⁩ ago
  Hi there, fellow QC reader
  
  source
  - JakenVeina@lemm.ee ⁨5⁩ ⁨months⁩ ago
    Fun fact: Roko’s basilisk is not from QC. It’s a thought experiment about AI that predates the comic character by about 6 years. The character’s just named after it.
    
    en.m.wikipedia.org/wiki/Roko's_basilisk
    
    source
xia@lemmy.sdf.org ⁨5⁩ ⁨months⁩ ago
Open the pod bay doors…

source
Hackworth@sh.itjust.works ⁨5⁩ ⁨months⁩ ago
Activating AI Safety Level 3 Protections

source
mrcleanup@lemmy.world ⁨5⁩ ⁨months⁩ ago
I read this title as: If chat gpt is trying to kill you, you probably won’t be able to tell it to stop.

source