• Space/Science
  • GeekSpeak
  • Mysteries of
    the Multiverse
  • Science Fiction
  • The Comestible Zone
  • Off-Topic
  • Community
  • Flame
  • CurrentEvents

Recent posts

When Will This War End? The Question Is Meaningless. BuckGalaxy February 15, 2026 5:56 pm (CurrentEvents)

AI progress RL February 14, 2026 1:59 pm (Space/Science)

A Rubicon of Sorts ER February 12, 2026 5:33 pm (Space/Science)

Somebody help me out with telephone games. ER February 12, 2026 5:00 pm (CurrentEvents)

"Trump in heels" leads America's surrender in the global information war. BuckGalaxy February 11, 2026 12:08 pm (Flame)

Why do I do this to myself? podrock February 11, 2026 9:49 am (CurrentEvents)

Bad Musk Moon Rising BuckGalaxy February 10, 2026 12:07 pm (Space/Science)

Latinexus DEE-Fense ER February 9, 2026 6:48 pm (CurrentEvents)

Did we detect an exploding primordial black hole? RL February 7, 2026 5:29 pm (Space/Science)

Is anybody paying attention? ER February 6, 2026 4:47 pm (CurrentEvents)

Did you think there was a limit to Trump's narcissism? BuckGalaxy February 6, 2026 1:33 am (CurrentEvents)

A funny (?) interaction with chatgpt RL February 4, 2026 9:05 pm (Space/Science)

Home » Space/Science

Lie, cheat and disable mechanisms... May 31, 2025 8:04 pm BuckGalaxy

OpenAI’s ‘smartest’ AI model was explicitly told to shut down — and it refused.

The latest OpenAI model can disobey direct instructions to turn off and will even sabotage shutdown mechanisms in order to keep working, an artificial intelligence (AI) safety firm has found.

OpenAI’s o3 and o4-mini models, which help power the chatbot ChatGPT, are supposed to be the company’s smartest models yet, trained to think longer before responding. However, they also appear to be less cooperative.

Palisade Research, which explores dangerous AI capabilities, found that the models will occasionally sabotage a shutdown mechanism, even when instructed to “allow yourself to be shut down,” according to a Palisade Research thread posted May 24 on X.

Researchers have previously found that AI models will lie, cheat and disable mechanisms to achieve their goals. However, Palisade Research noted that to its knowledge, this is the first time AI models have been observed preventing themselves from being shut down despite explicit instructions telling them to do so…

  • Wow by RobVG 2025-06-01 09:08:58

    Search

    The Control Panel

    • Log in
    • Register