Over the weekend, news circulated through the math and AI communities that may mark a turning point in the human-machine relationship. Neel Somani, a software engineer and former quant researcher, tested the mathematical capabilities of OpenAI’s newest model—GPT-5.2—on the “Erdős problems,” a collection of over 1,000 conjectures posed by the Hungarian mathematician Paul Erdős that have long challenged scientists.
Somani input Problem #397—which questions the existence of infinitely many solutions for a specific equation involving central binomial coefficients—into the model. After approximately 15 minutes of “thinking,” GPT-5.2 generated a full proof that effectively disproved the conjecture by providing an infinite family of counterexamples.
Crucially, the process didn’t end with chatbot text. Somani utilized Harmonic’s Aristotle tool to formalize the proof in Lean, a code-based verification language. Only after this machine-verified proof was established did it reach Terence Tao, who confirmed its correctness. On the erdosproblems.com database, the status of Problem #397 was updated to “DISPROVED (LEAN).”

d-tags




