General1mo ago

AI Models Struggle with "Context Rot," Leading to Declining Performance as Conversations Grow Longer

LessWrongMay 10, 20261 min brief

In brief

Recent testing has revealed that large language models (LLMs) face a significant issue called "context rot." This occurs when the performance of AI systems diminishes as the length of conversations increases, often by double-digit percentages on tasks where shorter contexts performed well.
The primary solution so far is context compaction, where the model summarizes and discards unnecessary parts of the conversation.
However, this method can sometimes miss important details or reasoning chains, leading to potential issues in maintaining coherent interactions.
The core problem lies in how transformers process information.
Each response starts fresh, relying on the full context window without a persistent memory.
- This means any unique patterns or reasoning developed during a conversation are only sustained by the visible parts of the interaction.
If these elements are removed or altered, the model loses its ability to replicate that reasoning accurately.
To address this, researchers propose modifying the context between turns to disrupt latent reasoning.
By altering how the model processes and retains information, they aim to ensure that any reasoning must be explicitly verbalized, reducing reliance on potentially unstable contextual scaffolding.
- This approach could lead to more reliable and transparent AI interactions in the future.

Terms in this brief

Context Rot: A phenomenon where large language models (LLMs) perform worse as conversations get longer. Imagine having a conversation with someone who gradually forgets details from earlier in the chat, leading to inconsistencies or confusion. This is what context rot does to AI models, making their responses less coherent and accurate over time.

Read full story at LessWrong →

More briefs