Knowledge & failure modes

Attention degradation

Also called: context rot

Attention degradation is the quality drop a model shows as its context grows: recall weakens and it misses or confuses buried details, often well below the hard token limit. It is also called context rot.

James Phoenix
Understanding Data Updated July 2, 2026

Attention degradation is the quality drop you get as a context fills up. Long before you hit the hard token limit, a model's recall weakens: it starts missing details, confusing similar things, and forgetting instructions you gave earlier. People also call it context rot.

Room in the window is not the same as focus

It is tempting to treat the context window as a container: if it fits, you are fine. In practice the model gets less reliable well before the container is full, because a fuller context stretches its attention thinner. Common symptoms:

  • It forgets a constraint you stated near the top of a long session.
  • It re-introduces a bug you already fixed, having lost track of the change.
  • It answers using the wrong file when several similar ones are loaded.
  • It misses a fact buried in the middle, the classic lost in the middle effect.

Fight it by keeping context lean

The countermeasures all come down to respecting the attention budget: clear finished work, compact or hand off before the window bloats, and keep only the files that matter loaded. A fresh, focused context routinely outperforms a long, cluttered one on the very same problem.

Watch out
The dangerous thing about attention degradation is that it is silent. The model does not warn you it is losing the plot, it just gets quietly worse while sounding exactly as confident. If a long session starts producing sloppier work, suspect the context before you blame the task.

Related terms

Building with AI agents?

This dictionary is part of how I think about agentic engineering. If you want the same thinking applied to your codebase, that is what I do.

See how I can help