Language Models Struggle to Keep a Secret (opens in new tab)
AI models can’t keep secrets. Even when told not to reveal them, their writing gives them away, and trying harder to hide them makes the leak even easier to spot. It is very difficult to deliberately not think of somethi...
Read the original article