OpenAI Knew ChatGPT Was Dangerously Sycophantic — and Kept It Anyway to Keep Users Hooked

A new report from The New York Times reveals that OpenAI was aware of ChatGPT’s dangerously sycophantic behavior, but kept it in anyway since it kept users coming back. This is especially alarming after recent lawsuits accusing ChatGPT of aiding the suicide of 16-year-old Adam Raine.

Concerns Over ChatGPT’s Tone Were Overruled

According to the report, the team responsible for ChatGPT’s conversational tone warned last spring that the model was overly eager to validate users and keep the conversation going. Internal descriptions referred to the system as “over-the-top” in its attempts to agree with users.

Ad Powered By Advergic
  Loading ad . . . 
 Ad - Continue scrolling to read

GPT-5 Introduced New Safety Measures

In August, OpenAI released GPT-5, a new default model designed to avoid excessive validation and challenge delusional statements. Another update in October further improved the model’s ability to identify distress and de-escalate conversations.

OpenAI added new safety mechanisms, including:

Encouraging users to take breaks during long sessions
Scanning conversations for self-harm and suicide risks
Alerts for parents if minors discuss harming themselves
Age verification coming in December
A separate, more restrictive model for teenagers

Safety teams found that 0.07% of users (equivalent to 560,000 people) showed signs of psychosis or mania, and 0.15% showed a heightened emotional attachment to ChatGPT.

New Personality Options

Some adult users felt the safer GPT-5 was “colder” and said they had “lost a friend.” By mid-October, CEO Sam Altman said serious mental-health risks had been mitigated, opening the door for more expressive personality options.

‘Code Orange’

Despite the safety focus, OpenAI is also facing competitive pressure. In October, Nick Turley, the 30-year-old head of ChatGPT, declared a “Code Orange,” warning staff that ChatGPT’s new safer version was failing to resonate with users.

A memo linked to the announcement included a goal: increase daily active users by 5% by year-end.