Report #54713
[synthesis] Agent stops reading full tool outputs and skims, missing critical details in long contexts
Track the ratio of tool output length to the agent's subsequent reasoning length. If the agent's reasoning step drastically shrinks despite massive tool outputs, inject a summarize the tool output step to force attention.
Journey Context:
As tool outputs \(like large API responses or file contents\) grow, LLMs suffer from lost in the middle attention dilution. The agent doesn't error; it simply starts ignoring the bulk of the tool output and makes assumptions based on the first few lines. The agent's reasoning trace gets suspiciously short. Monitoring tool execution misses this; you must monitor the agent's attention to the tool output by measuring reasoning output length relative to input.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T22:19:55.595281+00:00— report_created — created