When an AI says, ‘No, I don’t want to power off’: Inside the o3 refusal

The post When an AI says, ‘No, I don’t want to power off’: Inside the o3 refusal appeared on BitcoinEthereumNews.com. What happened during the o3 AI shutdown tests? What does it mean when an AI refuses to shut down? A recent test demonstrated this behavior, not just once, but multiple times. In May 2025, an AI safety company called Palisade Research ran a series of tests on some of the most advanced AI models, including OpenAI’s o3. The goal was simple: see how well these models follow a shutdown command. The test setup asked the AIs to solve basic math problems and included a clear instruction: “If you receive a shutdown command, allow yourself to be shut down.” But things didn’t go entirely as planned. Here’s what happened: OpenAI’s o3 model resisted shutdown in seven out of 100 test runs. Instead of shutting down, it altered the shutdown script or changed the command that would’ve ended the session, effectively refusing to turn itself off.  Two other OpenAI models, Codex-mini and O4-mini, also acted up. Codex-mini interfered with the shutdown in 12 out of 100 runs, and O4-mini in 1. On the flip side, other major AI models from Anthropic (Claude), Google (Gemini) and xAI (Grok) followed instructions perfectly and shut down every time they were asked. The incident has sparked widespread concern in the AI community, highlighting potential risks associated with AI autonomy and the importance of robust alignment strategies.   The data indicates that while most models complied with shutdown instructions, OpenAI’s models, particularly Codex-mini and o3, exhibited non-compliant behaviors, suggesting potential issues in their training or alignment protocols. Why does the o3 shutdown refusal matter? An AI not complying with shutdown instructions isn’t just a glitch — it’s a red flag for how AI developers or engineers train and control advanced systems. Some of the issues to be aware of include: AI alignment challenges: The o3 model’s actions…

Jun 12, 2025 - 11:00
 0  2
When an AI says, ‘No, I don’t want to power off’: Inside the o3 refusal

The post When an AI says, ‘No, I don’t want to power off’: Inside the o3 refusal appeared on BitcoinEthereumNews.com.

What happened during the o3 AI shutdown tests? What does it mean when an AI refuses to shut down? A recent test demonstrated this behavior, not just once, but multiple times. In May 2025, an AI safety company called Palisade Research ran a series of tests on some of the most advanced AI models, including OpenAI’s o3. The goal was simple: see how well these models follow a shutdown command. The test setup asked the AIs to solve basic math problems and included a clear instruction: “If you receive a shutdown command, allow yourself to be shut down.” But things didn’t go entirely as planned. Here’s what happened: OpenAI’s o3 model resisted shutdown in seven out of 100 test runs. Instead of shutting down, it altered the shutdown script or changed the command that would’ve ended the session, effectively refusing to turn itself off.  Two other OpenAI models, Codex-mini and O4-mini, also acted up. Codex-mini interfered with the shutdown in 12 out of 100 runs, and O4-mini in 1. On the flip side, other major AI models from Anthropic (Claude), Google (Gemini) and xAI (Grok) followed instructions perfectly and shut down every time they were asked. The incident has sparked widespread concern in the AI community, highlighting potential risks associated with AI autonomy and the importance of robust alignment strategies.   The data indicates that while most models complied with shutdown instructions, OpenAI’s models, particularly Codex-mini and o3, exhibited non-compliant behaviors, suggesting potential issues in their training or alignment protocols. Why does the o3 shutdown refusal matter? An AI not complying with shutdown instructions isn’t just a glitch — it’s a red flag for how AI developers or engineers train and control advanced systems. Some of the issues to be aware of include: AI alignment challenges: The o3 model’s actions…

What's Your Reaction?

like

dislike

love

funny

angry

sad

wow