Factlen ExplainerAI in MathematicsExplainerJun 15, 2026, 10:36 AM· 4 min read

AI Models Generate Novel Mathematical Proofs, Disproving Decades-Old Conjectures

Frontier artificial intelligence systems have transitioned from solving known equations to generating original research-level proofs. Recent breakthroughs by OpenAI and Google DeepMind have successfully resolved long-standing open problems in discrete geometry and combinatorics.

By Factlen Editorial Team

Share this story

AI Capabilities Advocates 40%Mathematical Purists 40%Editorial Synthesis 20%

AI Capabilities Advocates: Argue that AI is transitioning from a computational tool to a genuine scientific collaborator capable of original discovery.
Mathematical Purists: Emphasize the need for rigorous evaluation to distinguish between true mathematical creativity and sophisticated pattern matching.
Editorial Synthesis: Synthesizes the evidence to provide a balanced view of AI's current capabilities and future trajectory.

What's not represented

· Educators adapting math curricula to AI capabilities
· Philosophers of science studying the nature of machine creativity

Why this matters

For decades, advanced mathematics has been the exclusive domain of human intuition. AI's new ability to autonomously solve open research problems signals a fundamental shift in how scientific discovery will happen, accelerating breakthroughs in physics, cryptography, and engineering.

Key points

In early 2026, frontier AI models transitioned from solving known math problems to generating novel proofs for open research questions.
OpenAI's model disproved a decades-old conjecture in discrete geometry related to the Erdős planar unit distance problem.
Google DeepMind's Aletheia agent autonomously solved multiple open questions from the Erdős Conjectures database.
The AI achieved these breakthroughs by applying techniques from unexpected fields, such as algebraic number theory, to geometric problems.
Mathematicians are launching initiatives to rigorously evaluate whether AI is demonstrating true creativity or merely sophisticated computation.

1946

Year Erdős posed the problem

700

Open problems evaluated by AI

90%

AI score on PhD-level math

For decades, artificial intelligence has treated mathematics as a closed-book exam. Models were trained to solve known problems, competing on olympiads and standardized tests where the answers were already established. As recently as 2025, the benchmark for AI was achieving gold-medal performance at the International Mathematical Olympiad—a formidable feat, but one that relied on curated problems with fixed targets.[1][2]

But in the spring of 2026, the paradigm shifted from retrieval to genuine discovery. AI is no longer just passing tests; it is actively expanding the frontier of human knowledge by generating novel proofs for open research questions that have stumped mathematicians for decades. This transition marks the moment artificial intelligence moved from being a sophisticated calculator to a genuine scientific collaborator.[6]

Two major breakthroughs have demonstrated that frontier AI models are now capable of producing original, research-level mathematical proofs. In February, Google DeepMind unveiled Aletheia, an agent powered by Gemini Deep Think, which autonomously solved open questions from the Erdős Conjectures database.[2]

Then, in May, OpenAI announced that its general-purpose reasoning model had disproved a central conjecture in discrete geometry tied to Paul Erdős's planar unit distance problem—a famously stubborn question first posed in 1946. The Erdős planar unit distance problem asks for the maximum number of pairs of points in a plane that can be exactly one unit distance apart.[1]

The Erdős planar unit distance problem asks for the maximum number of pairs of points in a plane that can be exactly one unit distance apart.

For nearly eighty years, mathematicians had relied on grid-based, geometric intuition to approach the problem, treating certain constructions as basically optimal. OpenAI's model upended this assumption entirely. Instead of relying purely on geometric grids, the AI drew from unexpected areas, including algebraic number theory, to construct a counterexample that beat the constructions mathematicians had long accepted.[1][4]

The mathematical community's reaction has been unprecedented. Fields Medalist Tim Gowers called the result a milestone in AI mathematics, positioning it as one of the first clear cases of an AI independently solving a famous open mathematical problem.[1]

The mathematical community's reaction has been unprecedented.

Mathematician Daniel Litt noted that it is the most unique interesting result produced autonomously by AI so far, and the first AI proof that would likely be publishable in a top-tier math journal even if humans had produced it alone. The fact that the AI crossed disciplinary boundaries to find its solution is what makes the achievement so remarkable to veterans of the field.[1][3]

DeepMind's Aletheia achieved similar milestones in its own right. Unlike Olympiad problems, research-level mathematics requires navigating vast literature and avoiding the hallucinations that plague standard foundation models. To overcome this, Aletheia employs a natural language verifier that identifies flaws in candidate solutions, enabling an iterative process of generating, verifying, and revising.[2]

Crucially, the agent can admit failure, a feature that improves efficiency when collaborating with human researchers. Aletheia's capabilities extend far beyond pure combinatorics. It recently extended a theorem in economic theory—the Revelation Principle for auctioning AI generation tokens—by using advanced topology and order theory to accommodate continuous real numbers, invalidating the original proof that only worked for rational numbers.[2]

AI performance has scaled rapidly from high school Olympiad problems to PhD-level research exercises.

It also found novel analytical solutions to tricky integrals containing singularities, which are necessary for calculating gravitational radiation from cosmic strings in physics. These cross-disciplinary applications prove that the underlying reasoning engine is generalizing across the hard sciences.[2]

Despite these successes, the question of whether AI is truly creative remains a subject of intense debate among experts. The First Proof initiative, launched by the National Museum of Mathematics and featuring prominent mathematicians like Manjul Bhargava and Alex Kontorovich, aims to evaluate whether these systems are genuinely engaging in discovery or merely performing sophisticated combinatorial search.[5]

As Lauren Williams, a MacArthur Fellow and panelist for the initiative, noted, AI can sometimes produce brilliant solutions, but other times it generates answers that are nowhere near correct. The boundary between creative mathematical insight and brute-force computation is becoming increasingly blurred, forcing the community to redefine what it means to do mathematics.[5][6]

Mathematicians are increasingly treating AI as a scientific collaborator rather than just a computational tool.

Furthermore, the originality of these AI-generated proofs is under intense scrutiny. If a model merely surfaces hidden but existing answers by connecting vast amounts of literature, its utility is immense but fundamentally different from human intuition. However, if AI models can assemble partially known ideas into something substantially new—as seen in the application of algebraic number theory to a geometric problem—they cross the threshold into true scientific collaboration.[1][4][6]

The implications for the future of mathematics are profound. AI is transitioning from a calculator to a co-author. As these models scale, they are expected to tackle even more abstract fields, such as algebraic geometry and topology, where human intuition often hits its limits. For now, the consensus is clear: AI has not solved math, nor has it replaced the human mathematician. But it has undeniably become a powerful scientific companion, capable of breaking century-old deadlocks and expanding the frontier of human knowledge.[2][5][6]

How we got here

1946
Paul Erdős poses the planar unit distance problem, which remains unsolved for decades.
July 2025
AI models reach Gold-medal performance at the International Mathematical Olympiad, solving known problems.
February 2026
Google DeepMind's Aletheia agent autonomously solves open questions from the Erdős Conjectures database.
May 2026
OpenAI's reasoning model disproves a central conjecture tied to the Erdős planar unit distance problem.

Viewpoints in depth

AI Capabilities Advocates

Argue that AI is transitioning from a computational tool to a genuine scientific collaborator capable of original discovery.

Researchers at organizations like Google DeepMind and OpenAI view these milestones as proof that foundation models can synthesize vast literature and apply techniques across disparate fields. By combining natural language verifiers with reinforcement learning, they believe AI can overcome hallucinations and tackle increasingly abstract mathematical domains. They point to the AI's ability to cross disciplinary boundaries—such as using algebraic number theory for a geometric problem—as evidence of emergent creativity.

Mathematical Purists

Emphasize the need for rigorous evaluation to distinguish between true mathematical creativity and sophisticated pattern matching.

Initiatives like 'First Proof' highlight that while AI can produce brilliant solutions, it also generates nonsensical answers that require human experts to untangle. Mathematicians argue that true discovery involves not just finding a proof, but understanding why it matters and how it connects to the broader mathematical universe. They caution against equating massive combinatorial search with human intuition, stressing that AI is currently best utilized as an advanced assistant rather than an independent researcher.

What we don't know

Whether AI models can generalize their success in combinatorics and discrete geometry to highly abstract fields like algebraic geometry.
How the mathematical community will formally credit AI systems in peer-reviewed journals moving forward.

Key terms

Discrete Geometry: A branch of mathematics that studies combinatorial properties and constructive methods of discrete geometric objects, like points and lines.
Erdős Planar Unit Distance Problem: A famous open question asking for the maximum number of pairs of points in a plane that can be exactly one unit distance apart.
Combinatorics: The mathematics of counting, arranging, and finding patterns in complex sets.
Algebraic Number Theory: A major branch of number theory that studies algebraic numbers and their properties, which the AI unexpectedly used to solve a geometric problem.

Frequently asked

Has AI completely solved mathematics?

No. While AI has made breakthroughs in specific areas like discrete geometry, it still struggles with highly abstract fields and sometimes produces incorrect solutions that require human verification.

What makes this breakthrough different from AI passing math tests?

Previous AI models solved known problems with established answers. The 2026 breakthroughs represent AI generating novel proofs for open research problems that humans had not yet solved.

Can AI write its own proofs without human help?

Yes, in some cases. DeepMind's Aletheia and OpenAI's models have autonomously generated proofs that were later verified by human experts, though human-AI collaboration remains the most efficient approach.

Sources

[1]ForbesAI Capabilities Advocates
The AI Breakthrough That Has Mathematicians Paying Attention
Read on Forbes →
[2]Google DeepMindAI Capabilities Advocates
Accelerating Mathematical and Scientific Discovery with Gemini Deep Think
Read on Google DeepMind →
[3]Scientific AmericanMathematical Purists
AI Disproves Decades-Old Math Conjecture
Read on Scientific American →
[4]The New York TimesMathematical Purists
In a First, AI Discovers a Novel Mathematical Proof
Read on The New York Times →
[5]National Museum of MathematicsMathematical Purists
First Proof: Mathematicians Putting AI to the Test
Read on National Museum of Mathematics →
[6]Factlen Editorial TeamEditorial Synthesis
Synthesis by Factlen editorial team
Read on Factlen Editorial Team →

Stay informed

Every angle. Every day.

Get science stories with full source coverage and perspective breakdowns delivered to your inbox.

Get the briefing →Browse science