Claude Opus 4.8 got more honest. Now it'll tell you your codebase is the problem. (opens in new tab)
Claude Opus 4.8 shipped on May 28, 41 days after 4.7. The benchmark bumps got the headlines. The upgrade that actually changes how it behaves in your repo is the quiet one: it got more honest. Anthropic's framing is that the model is "more likely to flag uncertainties about its work and less likely to make unsupported claims." Bridgewater, an early tester, called out its "tendency to proactively flag issues with the inputs and outputs of an analysis." That sounds like a safety footnote. For a...
Read the original article