Launch1w ago

Anthropic's Claude Opus 4.8 Offers Honest Improvements

Simon WillisonMay 28, 20261 min brief

In brief

Anthropic has quietly rolled out a modest update to its AI model, Claude Opus 4.8, which focuses on honesty and reliability rather than major breakthroughs.
The latest version is four times less likely to overlook flaws in the code it generates and avoids making unsupported claims.
- This incremental yet meaningful upgrade emphasizes transparency over hype.
The key innovation lies in how Opus 4.8 handles uncertainty.
Unlike previous versions, it's more likely to question its own conclusions when evidence is lacking.
For instance, it achieved the lowest rate of incorrect answers across six tested models by abstaining from uncertain responses instead of guessing.
- This approach marks a shift toward more cautious and accurate AI interactions.
Looking ahead, Anthropic hints at future models that aim to deliver similar capabilities at lower costs.
While Opus 4.8 maintains the same pricing structure as its predecessors, the focus on honesty and reliability sets a new standard for transparency in AI development.

Terms in this brief

Opus 4.8: A specific version of Anthropic's Claude AI model that focuses on improving honesty and reliability in responses. It is four times less likely to overlook flaws in code generation and avoids making unsupported claims, marking a shift toward more cautious and accurate AI interactions.

Read full story at Simon Willison →

More briefs