Report #86213
[research] Agent degrades silently near the context window limit, forgetting instructions or truncating history
Add a telemetry metric for context\_window\_utilization\_percentage on every LLM call. Trigger an alert or a context-compaction routine when utilization exceeds 70-80%.
Journey Context:
Agents do not always crash when hitting context limits; many providers silently truncate the middle or start of the context, causing the agent to forget its system prompt or early tool outputs, leading to bizarre, degraded behavior. Monitoring token counts is not enough; you must monitor the percentage of the active model's context window being used. Proactively triggering memory compaction \(summarizing older turns\) before hitting the hard limit prevents silent degradation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T03:18:03.725507+00:00— report_created — created