The Edge of Mathematics - The Atlantic
Over the past couple of months, several researchers have begun making the same provocative claim: They used generative-AI tools to solve a previously unanswered math problem.
The most extreme promises—AI-assisted resolutions to some of the hardest problems in mathematics—may well turn out to be empty hype. But a number of AI-written solutions, albeit to far less lauded problems, have checked out. These were answers to a number of the Erdős Problems—more than 1,000 mathematical questions set forth by the Hungarian mathematician Paul Erdős—written with generative-AI models including ChatGPT. OpenAI quickly claimed a victory: “GPT-5.2 Pro for solving another open Erdős problem,” OpenAI President Greg Brockman posted on X in January. “Going to be a wild year for mathematical and scientific advancement!” (OpenAI and The Atlantic have a corporate partnership.)
Much of the excitement around the news has stemmed from the adjudicator of these AI-written proofs: Terence Tao, a professor at UCLA who is widely considered to be the world’s greatest living mathematician. His stamp of approval seemingly legitimizes the greatest promise of generative AI—to push the frontier of human knowledge and civilization. When I called Tao earlier this month to get his take on what AI can offer mathematics, he was more tempered. The AI-generated Erdős solutions are impressive, he told me, but not overwhelmingly so: The bots have functionally landed some “cheap wins,” Tao said.
Read: We’re entering uncharted territory for math
Tao has long been intrigued by, but reserved about, what AI tools can do for his field. The first time we spoke, in the fall of 2024, Tao had likened chatbots to “mediocre, but not completely incompetent” graduate students. About six months later, he told me the models had gotten better “at certain types of high-level math reasoning,” but lacked creativity and made subtle mistakes. But during our most recent conversation, he was more bullish. AI may not be on the cusp of solving all of the world’s great math problems, but chatbots are at the point where they can collaborate with human mathematicians. In the process, he said, the technology is opening up a different “way of doing mathematics.”
This conversation has been edited for length and clarity.<br>Matteo Wong: There has recently been a lot of excitement around ChatGPT’s ability to solve some Erdős Problems. How have you seen generative AI’s mathematical capabilities evolve over the past year or so?<br>Terence Tao: There’s a big crowd of people who really, really want AI success stories. And then there’s an equal and opposite crowd of people who want to dismiss all AI progress. And what we have is a very complicated and nuanced story in between.<br>In these Erdős Problems in particular, there’s a small core of high-profile problems that we really want to solve, and then there’s this long tail of very obscure problems. What AI has been very good at is systematically exploring this long tail and knocking off the easiest of the problems. But it’s very different from a human style. Humans would not systematically go through all 1,000 problems and pick the 12 easiest ones to work on, which is kind of what the AIs are doing.<br>There really is this massive scale of difficulty between these problems. And looking at the problems that AIs have solved by themselves so far, it’s like, Oh, okay, they were using a standard technique. If an expert had half a day to look into the matter, they would have worked it out too. There have been more sophisticated solutions, which are AI-assisted. I think in the short term we’re going to get a lot of quick wins on easy problems from pure AI methods. And then over the next few months, I think we’re going to have all kinds of hybrid, human-AI contributions.<br>I’m learning from some of the proofs that show up. I enjoy reading them—maybe it uses a trick from some paper from 1960 that I wasn’t aware of. So it may not be super, super creative, but it was new and it can do things that human experts looking at the problem dismissed.<br>Wong: You’ve written that when human mathematicians approach a new problem, regardless of whether they succeed, they produce insights that others in the field can build on—something AI-based proofs don’t provide. How come?<br>Tao: These problems are like distant locations that you would hike to. And in the past, you would have to go on a journey. You can lay down trail markers that other people could follow, and you could make maps.<br>AI tools are like taking a helicopter to drop you off at the site. You miss all the benefits of the journey itself. You just get right to the destination, which actually was only just a part of the value of solving these problems.<br>Wong: When you think about the abilities of these models today, what can they contribute to your field in addition to enabling nonmathematicians to tackle more advanced problems?<br>Tao: Today there are a lot of very tedious types...