We Reproduced Anthropic's Mythos Findings With Public Models (opens in new tab)
submitted by codeinabox to security1 points | 0 comments Anthropic presents Mythos and Project Glasswing as evidence that advanced AI vulnerability research should be restricted. But our replication suggests a different conclusion: the capabilities Anthropic points to are already available in public models, so defenders should prepare for that reality instead.
Read the original article