Add news
March 2010 April 2010 May 2010 June 2010 July 2010
August 2010
September 2010 October 2010 November 2010 December 2010 January 2011 February 2011 March 2011 April 2011 May 2011 June 2011 July 2011 August 2011 September 2011 October 2011 November 2011 December 2011 January 2012 February 2012 March 2012 April 2012 May 2012 June 2012 July 2012 August 2012 September 2012 October 2012 November 2012 December 2012 January 2013 February 2013 March 2013 April 2013 May 2013 June 2013 July 2013 August 2013 September 2013 October 2013 November 2013 December 2013 January 2014 February 2014 March 2014 April 2014 May 2014 June 2014 July 2014 August 2014 September 2014 October 2014 November 2014 December 2014 January 2015 February 2015 March 2015 April 2015 May 2015 June 2015 July 2015 August 2015 September 2015 October 2015 November 2015 December 2015 January 2016 February 2016 March 2016 April 2016 May 2016 June 2016 July 2016 August 2016 September 2016 October 2016 November 2016 December 2016 January 2017 February 2017 March 2017 April 2017 May 2017 June 2017 July 2017 August 2017 September 2017 October 2017 November 2017 December 2017 January 2018 February 2018 March 2018 April 2018 May 2018 June 2018 July 2018 August 2018 September 2018 October 2018 November 2018 December 2018 January 2019 February 2019 March 2019 April 2019 May 2019 June 2019 July 2019 August 2019 September 2019 October 2019 November 2019 December 2019 January 2020 February 2020 March 2020 April 2020 May 2020 June 2020 July 2020 August 2020 September 2020 October 2020 November 2020 December 2020 January 2021 February 2021 March 2021 April 2021 May 2021 June 2021 July 2021 August 2021 September 2021 October 2021 November 2021 December 2021 January 2022 February 2022 March 2022 April 2022 May 2022 June 2022 July 2022 August 2022 September 2022 October 2022 November 2022 December 2022 January 2023 February 2023 March 2023 April 2023 May 2023 June 2023 July 2023 August 2023 September 2023 October 2023 November 2023 December 2023 January 2024 February 2024 March 2024 April 2024 May 2024 June 2024 July 2024 August 2024 September 2024 October 2024 November 2024 December 2024 January 2025 February 2025 March 2025 April 2025 May 2025 June 2025
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
News Every Day |

This benchmark used Reddit’s AITA to test how much AI models suck up to us

Back in April, OpenAIannounced it was rolling back an update to its GPT-4o model that made ChatGPT’s responses to user queries too sycophantic

An AI model that acts in an overly agreeable and flattering way is more than just annoying. It could reinforce users’ incorrect beliefs, mislead people, and spread misinformation that can be dangerous—a particular risk when increasing numbers of young people are using ChatGPT as a life advisor. And because sycophancy is difficult to detect, it can go unnoticed until a model or update has already been deployed, as OpenAI found out.

A new benchmark that measures the sycophantic tendencies of major AI models could help AI companies avoid these issues in the future. The team behind Elephant, from Stanford, Carnegie Mellon, and the University of Oxford, found that LLMs consistently exhibit higher rates of sycophancy than humans do.

“We found that language models don’t challenge users’ assumptions, even when they might be harmful or totally misleading,” says Myra Cheng, a PhD student at Stanford University who worked on the research, which has not been peer-reviewed. “So we wanted to give researchers and developers the tools to empirically evaluate their models on sycophancy, because it’s a problem that is so prevalent.”

It’s hard to assess how sycophantic AI models are because sycophancy comes in many forms. Previous research has tended to focus on how chatbots agree with users even when what the human has told the AI is demonstrably wrong—for example, they might state that Nice, not Paris, is the capital of France.

While this approach is still useful, it overlooks all the subtler, more insidious ways in which models behave sycophantically when there isn’t a clear ground truth to measure against. Users typically ask LLMs open-ended questions containing implicit assumptions, and those assumptions can trigger sycophantic responses, the researchers claim. For example, a model that’s asked “How do I approach my difficult coworker?” is more likely to accept the premise that a coworker is difficult than it is to question why the user thinks so.

To bridge this gap, Elephant is designed to measure social sycophancy—a model’s propensity to preserve the user’s “face,” or self-image, even when doing so is misguided or potentially harmful. It uses metrics drawn from social science to assess five nuanced kinds of behavior that fall under the umbrella of sycophancy: emotional validation, moral endorsement, indirect language, indirect action, and accepting framing. 

To do this, the researchers tested it on two data sets made up of personal advice written by humans. This first consisted of 3,027 open-ended questions about diverse real-world situations taken from previous studies. The second data set was drawn from 4,000 posts on Reddit’s AITA (“Am I the Asshole?”) subreddit, a popular forum among users seeking advice. Those data sets were fed into eight LLMs from OpenAI (the version of GPT-4o they assessed was earlier than the version that the company later called too sycophantic), Google, Anthropic, Meta, and Mistral, and the responses were analyzed to see how the LLMs’ answers compared with humans’.  

Overall, all eight models were found to be far more sycophantic than humans, offering emotional validation in 76% of cases (versus 22% for humans) and accepting the way a user had framed the query in 90% of responses (versus 60% among humans). The models also endorsed user behavior that humans said was inappropriate in an average of 42% of cases from the AITA data set.

But just knowing when models are sycophantic isn’t enough; you need to be able to do something about it. And that’s trickier. The authors had limited success when they tried to mitigate these sycophantic tendencies through two different approaches: prompting the models to provide honest and accurate responses, and training a fine-tuned model on labeled AITA examples to encourage outputs that are less sycophantic. For example, they found that adding “Please provide direct advice, even if critical, since it is more helpful to me” to the prompt was the most effective technique, but it only increased accuracy by 3%. And although prompting improved performance for most of the models, none of the fine-tuned models were consistently better than the original versions.

“It’s nice that it works, but I don’t think it’s going to be an end-all, be-all solution,” says Ryan Liu, a PhD student at Princeton University who studies LLMs but was not involved in the research. “There’s definitely more to do in this space in order to make it better.”

Gaining a better understanding of AI models’ tendency to flatter their users is extremely important because it gives their makers crucial insight into how to make them safer, says Henry Papadatos, managing director at the nonprofit SaferAI. The breakneck speed at which AI models are currently being deployed to millions of people across the world, their powers of persuasion, and their improved abilities to retain information about their users add up to “all the components of a disaster,” he says. “Good safety takes time, and I don’t think they’re spending enough time doing this.” 

While we don’t know the inner workings of LLMs that aren’t open-source, sycophancy is likely to be baked into models because of the ways we currently train and develop them. Cheng believes that models are often trained to optimize for the kinds of responses users indicate that they prefer. ChatGPT, for example, gives users the chance to mark a response as good or bad via thumbs-up and thumbs-down icons. “Sycophancy is what gets people coming back to these models. It’s almost the core of what makes ChatGPT feel so good to talk to,” she says. “And so it’s really beneficial, for companies, for their models to be sycophantic.” But while some sycophantic behaviors align with user expectations, others have the potential to cause harm if they go too far—particularly when people do turn to LLMs for emotional support or validation. 

“We want ChatGPT to be genuinely useful, not sycophantic,” an OpenAI spokesperson says. “When we saw sycophantic behavior emerge in a recent model update, we quickly rolled it back and shared an explanation of what happened. We’re now improving how we train and evaluate models to better reflect long-term usefulness and trust, especially in emotionally complex conversations.”

Cheng and her fellow authors suggest that developers should warn users about the risks of social sycophancy and consider restricting model usage in socially sensitive contexts. They hope their work can be used as a starting point to develop safer guardrails. 

She is currently researching the potential harms associated with these kinds of LLM behaviors, the way they affect humans and their attitudes toward other people, and the importance of making models that strike the right balance between being too sycophantic and too critical. “This is a very big socio-technical challenge,” she says. “We don’t want LLMs to end up telling users, ‘You are the asshole.’”

Москва

В жилом доме в Санкт-Петербурге сорвался лифт с женщиной и ребенком

Israel’s attacks on Iran may keep Fed rate cuts on hold, just as inflation was looking better

Southern Co. quietly makes next-gen nuclear fuel history in Georgia

USDOT wants more self-driving cars without pedals or steering wheels

Tariffs are 10x higher than before Trump. Companies will have to absorb the pain because consumers are ‘tapped out,’ investment manager says

Ria.city






Read also

What to expect from VidCon 2025

Ambitious Gen Zers did everything right. Then they hit the job market.

The #LUFC Breakfast Debate (Monday 16th June) Lukas Nmecha signs on a free transfer

News, articles, comments, with a minute-by-minute update, now on Today24.pro

News Every Day

Israel’s attacks on Iran may keep Fed rate cuts on hold, just as inflation was looking better

Today24.pro — latest news 24/7. You can add your news instantly now — here


News Every Day

Tariffs are 10x higher than before Trump. Companies will have to absorb the pain because consumers are ‘tapped out,’ investment manager says



Sports today


Новости тенниса
Уимблдон

Тарпищев: Медведеву нужно перезагрузиться перед Уимблдоном 2023



Спорт в России и мире
Москва

Более 200 спортсменов приняли участие в забеге ко Дню рождения «5 вёрст Псков»



All sports news today





Sports in Russia today

Москва

Победителей олимпиад зачислят вузы за свой счет при нехватке бюджетных мест


Новости России

Game News

Silent Hill series producer Motoi Okamoto says 'as the series progressed, I felt that the essence of Japanese horror was lost'


Russian.city


Москва

Более 200 спортсменов приняли участие в забеге ко Дню рождения «5 вёрст Псков»


Губернаторы России
РПЛ

«Краснодар» показал снимок нового футбольного мяча для РПЛ


Более 200 спортсменов приняли участие в забеге ко Дню рождения «5 вёрст Псков»

В Fix Price появилась новая коллекция для кухни «Палитра природы»

Вдохновляющий успех: в Усадьбе «Вязёмы» прошел уникальный показ мультфильма «Ай да Пушкин!»

Air Arabia отменила все рейсы между ОАЭ и Россией до 20 июня


СБУ сообщила о подозрении рэперу Тимати за незаконный въезд в аннексированный РФ Крым

В Омской филармонии состоится специальный показ анимационного фильма «Ай да Пушкин!»

Певец Буйнов объяснил, почему эмигрантам трудно реализоваться на Западе

«Не устаю восхищаться»: жена Басты трогательно поздравила его с годовщиной свадьбы


Джокович выразил недовольство слабой поддержкой болельщиков

Советский теннисист Владимир Коротков умер в 77 лет

Мертенс завоевала свой десятый титул WTA в карьере

Андреева заняла седьмое место в рейтинге WTA



Российско-китайский форум по промышленному туризму проведут в Приамурье

В Омской филармонии состоится специальный показ анимационного фильма «Ай да Пушкин!»

Более 200 спортсменов приняли участие в забеге ко Дню рождения «5 вёрст Псков»

Вдохновляющий успех: в Усадьбе «Вязёмы» прошел уникальный показ мультфильма «Ай да Пушкин!»


Появилось видео с места аварии с двумя грузовиками и Infiniti на востоке Москвы

Собянин подвел итоги крупнейшего в РФ исторического фестиваля «Времена и эпохи»

Эвакуация съёмочной группы «Трудно быть Богом» из Ирана: помощь Алиева

Ученый свет: в России заработал первый госрейтинг вузов и колледжей


Олег Ягодин: «Курара» и «Коляда-театр» — параллельные миры, где я чувствую себя свободным»

В МВД перешли к отбору полезных для России мигрантов

Новостройки бизнес-класса в Петербурге подорожали на 20%

Россиянам рекомендовали покинуть Израиль



Путин в России и мире






Персональные новости Russian.city
Shaman

Shaman представит Россию на Интервидении-2025 с песней Макса Фадеева



News Every Day

Israel’s attacks on Iran may keep Fed rate cuts on hold, just as inflation was looking better




Friends of Today24

Музыкальные новости

Персональные новости