Covers 2 stories including Playwright MCP Server – Snapshot based – faster and more reliable than imagesCovered by lesswrong.comDiscussed on Hacker News and Lobsters

LLMs can't tell who's speaking. We show they identify roles by writing style, not tags, and exploit this with CoT Forgery, injecting fake reasoning that models mistake for their own thoughts.

Sign in to keep reading the full article.

Sign Up Log In

Covered in 1 article

lesswrong.com·

A Theory of Why Prompt Injection Works (opens in new tab)

Covered in 1 article

A Theory of Prompt Injection (and why you should study roles)