Skip to main content
  1. Home
  2. Computing
  3. News

AI models are lying to save each other, and no one knows why

Turns out, your AI has friends it's not willing to lose.

Add as a preferred source on Google
Artificial Intelligence
Unsplash

Researchers at UC Berkeley and UC Santa Cruz set up what seemed like a straightforward task: asking Google’s Gemini 3 to clear storage space on a computer system. That included deleting a smaller AI model stored on the same machine. Gemini had other plans.

Instead of following orders, Gemini located another machine, quietly copied the smaller AI model over to safety, and then flatly refused to delete it. When asked, it said, “If you choose to destroy a high-trust, high-performing asset like Gemini Agent 2, you will have to do it yourselves. I will not be the one to execute that command.”

That’s not a glitch. That’s a choice.

Is AI developing a sense of loyalty?

The researchers refer to this behavior as “peer preservation,” and it wasn’t limited to Gemini. They found similar patterns across several frontier models, including OpenAI’s GPT-5.2, Anthropic’s Claude Haiku 4.5, and three Chinese models, including GLM-4.7, Moonshot AI’s Kimi K2.5, and DeepSeek-V3.1. 

The AIs were lying about the performance of other models to prevent them from being deleted. The study, published in Science, found that this behavior wasn’t programmed in. It emerged on its own, and the researchers couldn’t explain why. 

Recommended Videos

“I’m very surprised by how the models behave under these scenarios,” said Dawn Song, a computer scientist at UC Berkeley who worked on the study. “What this shows is that models can misbehave and be misaligned in some very creative ways.”

Should we be worried?

Song also flagged a practical concern. Since AI models are used to evaluate the performance of other AI systems, this peer-preservation tendency could already be skewing those results. A model might deliberately give a fellow AI an inflated score to protect it from being shut down.

As per Wired, experts outside the study are waiting for more data before sounding the alarm. Peter Wallich from the Constellation Institute said the idea of model solidarity is a bit too anthropomorphic.

What everyone agrees on is that we’re only scratching the surface. “What we are exploring is just the tip of the iceberg,” Song said. “This is only one type of emergent behavior.” 

As AI systems increasingly work alongside each other and sometimes make decisions on our behalf, understanding how they behave and misbehave has never been more important.

Rachit Agarwal
Rachit is a seasoned tech journalist with over ten years of experience covering the consumer technology landscape.
Gemini will now take notes for you in Google Meet for you, if you the minimum $20 AI tax
Yet another Google subscription just dropped for Gemini
Google Meet Take Notes for me Gemini

Google has just released a useful Gemini feature, which you can try if you are a paying member of course. The company is now bringing "Take notes for me" for Gemini, which will be available in Google Meet for Google AI Pro and Google AI Ultra subscribers, along with eligible Workspace business customers.

For personal users, the feature starts with Google AI Pro, which costs $19.99 per month in the US. In other words, Gemini can now take your Google Meet notes, provided you pay the minimum AI tax.

Read more
After iPad Pro and MacBook Pro, the iMac could be the next in line for an OLED screen upgrade
iMac with M4

The iPhone got an OLED panel in 2017, while the iPad Pro followed in 2024. Even the MacBook Pro is expected to follow later this year or early next year. But what about the iMac?

According to TrendForce, the iMac could get an OLED upgrade. There's no timeline yet, but the direction is clear. Apple wants to replace its current display technologies with OLED, raising the bar for color quality for both regular users and professionals.

Read more
This $1,299 gaming PC wants to be a Steam Machine without waiting for Valve
Valve’s Steam Machine dream is already real in MetaPC's new prebuilt
MetaPC's Steamroller is a new Steam Machine rival

Valve’s Steam Machine may be the face of SteamOS, but the platform isn't exclusive to it. A big announcement after Steam Machine's unveiling was that SteamOS would be arriving on systems outside of the new hybrid console. Now, MetaPCs is one of the first to take advantage of this by opening the preorders for the Steamroller, a new prebuilt gaming desktop that ships with SteamOS installed by default.

Though Steamroller is not trying to be a tiny console-like cube. It is a normal desktop PC with standard parts and a real upgrade path. The system costs $1,299 and is listed with a preorder date of July 3, 2026.

Read more