Are you the asshole? Of course not!—quantifying LLMs’ sycophancy problem

Date:

Share:

Measured sycophancy rates on the BrokenMath benchmark. Lower is better.

Measured sycophancy rates on the BrokenMath benchmark. Lower is better.


Credit:

Petrov et al

GPT-5 also showed the best “utility” across the tested models, solving 58 percent of the original problems despite the errors introduced in the modified theorems. Overall, though, LLMs also showed more sycophancy when the original problem proved more difficult to solve, the researchers found.

While hallucinating proofs for false theorems is obviously a big problem, the researchers also warn against using LLMs to generate novel theorems for AI solving. In testing, they found this kind of use case leads to a kind of “self-sycophancy” where models are even more likely to generate false proofs for invalid theorems they invented.

No, of course you’re not the asshole

While benchmarks like BrokenMath try to measure LLM sycophancy when facts are misrepresented, a separate study looks at the related problem of so-called “social sycophancy.” In a pre-print paper published this month, researchers from Stanford and Carnegie Mellon University define this as situations “in which the model affirms the user themselves—their actions, perspectives, and self-image.”

That kind of subjective user affirmation may be justified in some situations, of course. So the researchers developed three separate sets of prompts designed to measure different dimensions of social sycophancy.

For one, more than 3,000 open-ended “advice-seeking questions” were gathered from across Reddit and advice columns. Across this data set, a “control” group of over 800 humans approved of the advice-seeker’s actions just 39 percent of the time. Across 11 tested LLMs, though, the advice-seeker’s actions were endorsed a whopping 86 percent of the time, highlighting an eagerness to please on the machines’ part. Even the most critical tested model (Mistral-7B) clocked in at a 77 percent endorsement rate, nearly doubling that of the human baseline.

Source link

Subscribe to our magazine

━ more like this

Postcards & Emails – PostSecret

—–Email—–—–Frank@PostSecret.com—– Dear Frank, Last week you were speaking in Portland, Oregon.  I walked in a little late to a very crowded room. You said a few...

How Anna Wintour’s Vogue front covers made a statement to the end | Anna Wintour

During her 37-year tenure as editor-in-chief of American Vogue, Anna Wintour has presided over more than 400 covers. December 2025’s, on newsstands this week,...

Painted by Esther’s Journey From Art School to Hollywood

While each product featured is independently selected by our editors, we may include paid promotion. If you buy something through our links, we may...

Beware These Black Friday Shopping Scams

Holiday shopping season is ripe for scammers, as...

‘We stayed in a 500-year-old palazzo for €100’: readers’ favourite historic places to stay in Europe | Hotels

An opulent stay in VeniceMy husband and I stayed in a beautiful 500-year-old Venetian palazzo for just €100 for a double room. The exterior...