https://www.anthropic.com/research/nuclear-safeguards-for-ai (opens in new tab)
Together with the NNSA and DOE national laboratories, we have co-developed a classifier—an AI system that automatically categorizes content—that distinguishes between concerning and benign nuclear-related conversations with high accuracy in preliminary testing.
Read the original article