Per-Turn Evaluation: Dynamic Governance for AI Agents (opens in new tab)
Static governance is fine right up until your agent changes modes mid-session. The same agent can spend turn 1 researching docs, turn 8 editing code, turn 14 fixing failed tests, and turn 20 preparing a production deploy. Pretending one startup-time config should govern all of that is the harness equivalent of hardcoding production policy into a shell alias. That is why I'm increasingly convinced that per-turn evaluation needs to be a first-class primitive in agentic systems. If you're seriou...
Read the original article