https://www.anthropic.com/research/towards-understanding-sycophancy-in-language-models (opens in new tab)
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Read the original article