A Faster Librarian (opens in new tab)
The Librarian is now ~3x faster and 43% cheaper, with the same quality. It now runs on GPT-5.5 (no reasoning) with websocket mode and an updated system prompt that encourages more parallel exploration. The Librarian fires ~8 tool calls in parallel per turn, up from ~3 with Sonnet, and wraps up a search in ~5 turns instead of ~15. In our internal eval, about a quarter of that speedup comes from OpenAI's websocket mode and the rest from switching to GPT-5.5 with no reasoning: Sonnet-4.6 (medium...
Read the original article