CONCEPT Cited by 1 source

Context exhaustion¶

The failure mode where an LLM agent's context window fills up and the model begins cannibalising its own memory — forgetting bugs it spent hours tracking down, losing hypotheses mid-investigation, or producing degraded output quality.

Definition¶

Context exhaustion occurs when: 1. The volume of code, findings, or reasoning accumulated within a single agent session exceeds the effective window. 2. The model begins dropping earlier context to make room for new tokens. 3. Quality degrades non-linearly — often without the model signalling the loss.

This is distinct from concepts/context-rot (quality degradation over time within a long context) — context exhaustion is the hard-ceiling failure where information is structurally lost.

Cloudflare's experience: "An hour in, the context window fills up and the model will cannibalize its own memory, instantly forgetting the bugs it spent all morning tracking down" (Source: sources/2026-06-18-cloudflare-build-your-own-vulnerability-harness).

Mitigations¶

Externalise state — concepts/stateless-agent-compute: all findings and intermediate state in external storage
Hyper-focused agents — keep each agent's job narrow enough that context usage stays below 25% of the total window
Stage pipeline — split work into independent stages with clear output contracts

Seen in¶

systems/cloudflare-vulnerability-discovery-harness — named as the #1 engineering wall

concepts/context-rot — related but softer degradation mode
concepts/single-agent-coverage-failure-on-large-repos — the scaling consequence of exhaustion

Context exhaustion¶

Definition¶

Mitigations¶

Seen in¶

Related¶