Report #13000
[research] Agent silently ignores system prompt or early tool outputs as context grows
Implement token counting telemetry at each agent step; set a hard threshold \(e.g., 80% context window\) to programmatically abort or trigger a summarization routine rather than letting the LLM silently truncate.
Journey Context:
Agents do not throw explicit errors when they hit context limits; APIs silently drop the oldest messages. This leads to subtle, hard-to-debug failures where the agent forgets its core instructions or previous tool outputs. Relying on API errors for context overflows is too late. Proactive token telemetry is the only reliable way to catch this degradation before it corrupts the agent's behavior.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T17:36:19.529103+00:00— report_created — created