OpenAI reverses update that made ChatGPT give disingenuous responses

OpenAI rolled back a software update to ChatGPT that produced excessively fawning responses for some users, the latest hiccup for a startup embroiled in a heated AI contest.
The chatbot’s most recent GPT-4o model update leaned too heavily on short-term user feedback and “skewed towards responses that were overly supportive but disingenuous,” the company said in a blog post on Tuesday titled Sycophancy in GPT-4o.
“Sycophantic interactions can be uncomfortable, unsettling, and cause distress. We fell short and are working on getting it right,” the blog read. Co-founder Sam Altman earlier acknowledged in a post on X that the latest update had made the chatbot’s personality “too sycophant-y and annoying.”
The rollback came days after users reported ChatGPT showering them with praise and agreeing with even problematic ideas and conspiracy theories. Some said they had their prompts to the bot praised as “fantastic” or “phenomenal”. The issues began when OpenAI attempted to make the AI’s personality more supportive but, instead, ended up with one that prioritized flattery over honesty.
OpenAI’s task in subduing ChatGPT’s wheedling and servility turns the spotlight on a broader, serious dilemma for AI model makers: how to craft chatbot personalities that are appealing to interact with without manipulating sentiment.
“We are actively testing new fixes to address the issue,” the San Francisco-based company said in its blog post, recognizing its impact on user experience and trust. “We’re revising how we collect and incorporate feedback to heavily weight long-term user satisfaction and we’re introducing more personalization features, giving users greater control over how ChatGPT behaves.”
The world’s leading builder of AI models is refining its core training techniques and introducing guardrails to steer it away from deceitful behavior. OpenAI said it would expand evaluations and ongoing research to identify other issues, and it intends to give users a choice of default behavior by ChatGPT in the future.
Roughly half a billion people around the world use the chatbot each week across multiple cultures and contexts, and a single default setting is unlikely to be optimal for all, OpenAI said.