New Anthropic research: Teaching Claude why. (opens in new tab)
New Anthropic research: Teaching Claude why. Last year we reported that, under certain experimental conditions, Claude 4 would blackmail users. Since then, we’ve completely eliminated this behavior. How?
Read the original article