We scan new podcasts and send you the top 5 insights daily.
Moving beyond solving existing problems like the Millennium Prize problems, the true test of advanced AI in mathematics will be its ability to generate novel, interesting conjectures and create new, unifying definitions. This represents a higher tier of mathematical creativity, akin to the work of the greatest mathematicians who frame the questions for others to solve.
Generative AI can produce the "miraculous" insights needed for formal proofs, like finding an inductive invariant, which traditionally required a PhD. It achieves this by training on vast libraries of existing mathematical proofs and generalizing their underlying patterns, effectively automating the creative leap needed for verification.
An OpenAI model, without any specific mathematical training, solved a famous 80-year-old math problem. This proves general-purpose AI can autonomously produce landmark scientific results, not just accelerate human research. It signals a new era for discovery where AI is a primary research agent.
Hassabis argues AGI isn't just about solving existing problems. True AGI must demonstrate the capacity for breakthrough creativity, like Einstein developing a new theory of physics or Picasso creating a new art genre. This sets a much higher bar than current systems.
As AIs automate theorem proving and even explanation, the role of human mathematicians will shift. Instead of being creators, they will act as curators, using their taste and social connection to guide others through the vast, AI-generated landscape of mathematical ideas. Their value will lie in providing motivation and a human-centric narrative.
Unlike other sciences, mathematics has historically lacked a strong experimental branch. AI changes this by enabling large-scale studies—for example, testing a thousand different problem-solving approaches on a thousand problems. This creates a new, data-driven methodology for a field that has been almost entirely theoretical.
Future AI-driven mathematical discoveries will likely follow two paths. One is finding 'lightning bolt' connections between existing, disparate fields (e.g., number theory and physics). The other, more profound path, is 'mountain building'—constructing entirely new theoretical frameworks, a skill signifying a much higher level of general intelligence.
Proving theorems is only part of math. Axiom is developing tools for the pre-conjecture phase, helping mathematicians find interesting examples and constructions (like graphs or sequences). This AI-assisted discovery builds the intuition necessary before a formal proof can even be attempted.
Like DeepMind's AlphaFold, which predicted millions of protein structures to fill gaps in the proteome, mathematical AI will systematically solve known conjectures. This creates a vast, verified library of mathematical knowledge, which in turn becomes a more powerful foundation for solving even harder problems in a recursive, self-improving loop.
We have formal languages like Lean for deductive proofs, which AI can be trained on. The next frontier is developing a language to capture mathematical *strategy*—how to assess a conjecture's plausibility or choose a promising path. This would help automate the intuitive, creative part of mathematical discovery.
We perceive complex math as a pinnacle of intelligence, but for AI, it may be an easier problem than tasks we find trivial. Like chess, which computers mastered decades ago, solving major math problems might not signify human-level reasoning but rather that the domain is surprisingly susceptible to computational approaches.