Anthropic’s Claude Fable 5 plays it too safe on safety, developers say (opens in new tab) 🧠Claude
Anthropic on Tuesday , its most capable public model. But within two days, users began reporting that its safety system was blocking benign or legitimate prompts. Fable 5 is the first public model derived from Anthropic’s Mythos family, whose original iteration showed unusual skill during training at finding software bugs and exploiting them to disrupt or take control of systems. That raised enough concern inside Anthropic that the company grouped cybersecurity with other high-risk domains, i...
Read the original article