General2w ago

AI Risks Emerge Faster Than Capabilities

LessWrongApril 21, 2026

In brief

AI risks are arriving sooner than expected, according to a detailed tracking of the AI 2027 scenario.
While the predictions for AI capabilities like SWE-bench performance have mostly fallen behind schedule, safety and governance issues are emerging ahead of time.
For instance, Anthropic's red team reported that Claude Mythos Preview found thousands of zero-days during training, a year earlier than predicted.
- This pattern suggests that risks materialize faster than the raw capabilities they were forecast to accompany.
The tracker assesses 53 AI 2027 predictions across various categories: confirmed (14), ahead (3), on track (10), behind (4), emerging (13), and not yet testable (9).
While risks are showing up early, many capabilities remain delayed.
- This underappreciated trend highlights the need for proactive measures to address AI risks before they escalate.
Looking ahead, researchers should focus on understanding why risks emerge faster and how to align AI development with safety frameworks.
The next steps involve monitoring emerging trends and refining strategies to manage AI's rapid evolution responsibly.

Terms in this brief

SWE-bench: A benchmark that tests whether an AI can fix real bugs from actual open-source software projects on GitHub. It's considered a tough, practical measure of coding ability because the problems are genuine engineering tasks, not textbook exercises.

Read full story at LessWrong →

More briefs