Report #75780
[research] Agent suddenly fails on long tasks without obvious errors
Emit telemetry on context window utilization percentage at each agent step. Alert or trigger context-compression when utilization crosses 80%.
Journey Context:
Agents silently degrade when they approach the context limit. They don't throw a hard error; instead, they forget earlier instructions or tools, leading to loops or hallucinations. Observability must track the ratio of current tokens to max tokens per step, treating high context utilization as a leading indicator of failure.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T09:47:40.267908+00:00— report_created — created