Mythos Proves AI Safety Can No Longer Live Inside the Model (opens in new tab)

Covers 5 stories including Statement on the US government directive to suspend access to Fable 5 and Mythos 5Discussed on Hacker News

Anthropic restricted its most capable cyber model to vetted partners, routed risky requests away from it, and red-teamed it for thousands of hours. A jailbreak surfaced anyway, and the government pulled the model entirely. Every safety control in that story lived outside the model. That is the whole point.

Read the original article