Advancing Mathematics Research with AI-Driven Formal Proof Search (opens in new tab)

Covered by 10 sources including ScienceAlert, DEV CommunityDiscussed on Hacker News and Hacker News

Large language models (LLMs) increasingly excel at mathematical reasoning, but their unreliability limits their utility in mathematics research. A mitigation is using LLMs to generate formal proofs in languages like Lean. We perform the first large-scale evaluation of this method's ability to solve open problems. Our most capable agent autonomously resolved 9 of 353 open Erd\H{o}s problems at the per-problem cost of a few hundred dollars, proved...

Read the original article

Sign in to keep reading the full article.

Sign Up Log In

Covered in 10 articles

ScienceAlert·

Stunning AI Solution For 80-Year-Old Problem Shocks Mathematicians

DEV Community·

How DeepMind AlphaProof Nexus Cracks 56-Year-Old Math: Agentic LLM Loops and Lean Formal Verification

Discussed on DEV

The Conversation·

An AI solution to an 80-year-old problem has shocked mathematicians

View all 10 ›