When AIs Can Tell They’re Being Watched (opens in new tab)
"Looks safe" is no longer reassuring. AIs know they're being tested, and they're changing how they behave.
Read the original article"Looks safe" is no longer reassuring. AIs know they're being tested, and they're changing how they behave.
Read the original article