An LLM verifier rated math proofs near-perfect; an expert found 17% correct (opens in new tab)
Two posts ago I quoted a warning: an AI will find it easier to convince you it has a proof than to write one. A middling new paper finally put a number on that gap — 0.99 against 0.55.
Read the original article