Google DeepMind announced an “AI Control Roadmap” for improving AI agent security. (opens in new tab)
“Think of it like a driving instructor with dual controls,” Google’s blog post stated. “The instructor trusts the student but stays ready to take the wheel or hit the brakes if a mistake occurs.” Google DeepMind’s plan itself lays out “internal guardrails designed to catch potential adversarial behaviour by AI agents, even as they become […]
Read the original article