Flipping the eval on its head (opens in new tab)

Covers 2 stories including Project Glasswing: Securing critical software for the AI era

An eval is a product. Typically, its 1 x n or k x n where there are n samples and 1 or k different language models. This briefing will argue that we’d like to see k x n x m evals, or however many dimensions.This post is pitching an ambitious way to spend tokens on cyberhardening. If its not viable at current capabilities/costs, it may be viable next year or in six months.HardeningThere are broadly three approaches to cyberhardening with or uplifted formal methods.You can and then ask claude f...

Read the original article