Report #37723
[synthesis] Agent makes illogical tool calls without throwing errors as context limits approach
Monitor the token length of the agent's reasoning step independently of the tool call; set a floor threshold for reasoning length to detect cognitive truncation.
Journey Context:
Standard monitoring tracks total token usage and tool call success rates. As an agent approaches its context limit, it silently truncates its own internal reasoning to fit the output budget. The subsequent tool call is well-formed and returns 200 OK, but the decision-making is absent, leading to bizarre, context-blind actions. Because the tool call succeeds, no error is triggered. Tracking the ratio of reasoning tokens to action tokens reveals this cognitive collapse before it manifests as a user-facing bug.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T17:47:52.236082+00:00— report_created — created