Weekly Links (11/7/25)

AE Studio found that suppressing deception-related features in AI models dramatically increased consciousness reports Epoch AI explains data centers and their power demands Some lawyers that use AI are citing fake cases Pope Leo XIV on AI and moral discernment Anthropic is thinking about the downsides of retiring models “In fictional testing scenarios, Claude Opus 4, like previous models, advocated for its continued existence when faced with the possibility of being taken offline and replaced, especially if it was to be replaced with a model that did not share its values. Claude strongly preferred to advocate for self-preservation through ethical means, but when no other options were given, Claude’s aversion to shutdown drove it to engage in concerning misaligned behaviors.”

Similar Posts