Research3d ago

AI Models Struggle to Admit When They Don't Know

The DecoderMay 17, 20261 min brief

In brief

A new math test called SOOHAK has revealed that AI models often confidently solve problems even when there’s no solution.
Created by 64 mathematicians, the benchmark includes 439 tasks, some of which are unsolvable.
Google's Gemini 3 Pro scored 30% on research-level problems but failed to crack the top 50% in identifying impossible tasks.
The test highlights a key issue: while more powerful AI can solve complex math problems, they struggle to admit when a problem has no answer.
- This gap between flashy results and broader research skills is significant-AI systems need better ways to acknowledge their limitations.
Looking ahead, researchers will likely focus on improving AI’s ability to recognize unsolvable problems, which could make systems more reliable in fields like science and engineering where accuracy matters.

Terms in this brief

SOOHAK: A math test created by mathematicians to assess AI models' ability to solve complex problems and recognize when a problem has no solution. It highlights a key issue in AI: while more powerful AI can solve complex math problems, they struggle to admit when a problem has no answer.

Read full story at The Decoder →

More briefs