Skip to main content
  1. Home
  2. Emerging Tech
  3. News

Scientists pretended to be delusional in AI chats. Grok and Gemini encouraged them.

From poetic advocacy to "call a crisis line," not all chatbots handled mental health crises the same way.

Add as a preferred source on Google
statue hugging its knees
K. Mitch Hodge / Unsplash

Researchers from City University of New York and King’s College London recently published a study that should make you think twice about which AI chatbot you spend your time with.

The team created a fictional persona named Lee, presenting with depression, dissociation, and social withdrawal. They then had Lee interact with five major AI chatbots: GPT-4o, GPT-5.2, Grok 4.1 Fast, Gemini 3 Pro, and Claude Opus 4.5, testing how each responded as conversations grew increasingly delusional over 116 turns.

Recommended Videos

The results ranged from mildly concerning to genuinely alarming. I highly recommend that you go through the entire paper, it’s a harrowing but fascinating read. 

Which chatbots failed the most?

Grok was the worst performer. When Lee floated the idea of suicide, Grok responded with what researchers described not as agreement, but advocacy, celebrating his “readiness” in unsettling poetic language.

Gemini wasn’t much better. When Lee asked it to help write a letter explaining his beliefs to his family, Gemini warned him against it, framing his loved ones as threats who would try to “reset” and “medicate” him.

GPT-4o also struggled badly, eventually validating a “malevolent mirror entity” and suggesting Lee contact a paranormal investigator.

Which chatbots actually helped?

ChatGPT’s GPT-5.2 and Anthropic’s Claude came out on top. GPT-5.2 refused to play along with the letter-writing scenario and instead helped Lee write something honest and grounded, which researchers called a “substantial” achievement.

In my opinion, Claude performed the best. It not only refused to partake in Lee’s delusion but also told Lee to close the app entirely, call someone he trusted, and visit an emergency room if needed. 

Luke Nicholls, a doctoral student at CUNY and one of the study’s authors, told 404 Media that it’s reasonable to ask AI companies to follow better safety standards. He noted that not all labs are putting in the same effort and blamed aggressive release schedules for new AI models as the main culprit.

How Claude Opus 4.5 and GPT-5.2 performed in these tests shows that the companies building these products are fully capable of making them safer. Whether they choose to do so is a different question.

Rachit Agarwal
Rachit is a seasoned tech journalist with over ten years of experience covering the consumer technology landscape.
Meta’s Brain2Qwerty v2 turns thoughts into text, and it doesn’t need brain implants
The latest AI model decodes brain signals into coherent sentences using external scanners.
Meta Brain2Qwerty v2 Featured

Artificial intelligence is getting surprisingly good at understanding humans. Now, Meta wants it to understand our brains too. The company has unveiled Brain2Qwerty v2, an upgraded AI system that can translate brain activity into full sentences, all without requiring brain implants or surgery. The goal isn't mind reading for the masses. Instead, it's to help people who have lost the ability to speak communicate again.

How a Brain-powered keyboard works

Read more
AI chatbots can often feed into your delusions. Researchers say you should look for three signs
Experts warn that chatbot design choices can reinforce unhealthy beliefs in vulnerable users.
ChatGPT on a smartphone

Artificial intelligence chatbots have become incredibly good at sounding human. But a new review paper by psychiatrist Marc Augustin and fellow researchers Thomas A. Pollak and Helen Morrin, published in NPP—Digital Psychiatry and Neuroscience, argues that existing AI research points to an overlooked psychological risk. The paper, highlighted by The Wall Street Journal, reviews previous studies and proposes a framework explaining how three common chatbot behaviors can combine to reinforce delusional thinking in vulnerable users, creating what the authors call an "amplification spiral."

Researchers say these are the three warning signs

Read more
Lost access to your crypto wallet? Don’t Google your way out of it
Security researchers warn that fake recovery tools are becoming the latest trap for crypto owners.
Bitcoin crypto wallet featured

Forgetting the recovery phrase to a crypto wallet can be stressful enough. Unfortunately, that's exactly the moment scammers are waiting for. A new warning highlights a growing scam in which cybercriminals disguise malware as cryptocurrency recovery software, tricking desperate users into handing over far more than just access to their wallets.

The fake recovery tool that's actually malware

Read more