Anthropic on Tuesday , its most capable public model. But within two days, users began reporting that its safety system was blocking benign or legitimate prompts. Fable 5 is the first public model derived from Anthropic’s Mythos family, whose original iteration showed unusual skill during training at finding software bugs and exploiting them to disrupt or take control of systems. That raised enough concern inside Anthropic that the company grouped cybersecurity with other high-risk domains, i...

Sign in to keep reading the full article.

Cited by 1 article

Anthropic revises Claude Fable 5 guardrails after criticism

kite.kagi.com·