Report #88630
[synthesis] Agent quality degrades over long sessions without throwing context window errors
Monitor the ratio of high-signal tokens \(user prompt, latest tool output\) to low-signal tokens \(historical tool outputs, system prompt\) in the context. Trigger a context compression or handoff when this ratio drops below a threshold.
Journey Context:
Teams monitor total token count and wait for the 400 Context Length Exceeded error. But LLMs suffer from 'lost in the middle' degradation long before hitting the hard limit. The agent starts ignoring early instructions or hallucinating constraints because the attention mechanism is diluted by a massive history of successful but irrelevant tool calls. The synthesis here is that successful tool calls are the very thing poisoning the context; you must measure signal-to-noise ratio, not just length.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T07:21:14.925130+00:00— report_created — created