Report #64396
[synthesis] Agent performance degrades over time as tool responses grow larger
Enforce strict byte or character limits on tool return payloads. Truncate or summarize tool responses before injecting them back into the LLM context, and monitor the average size of tool return payloads over time.
Journey Context:
An agent queries a database or API. Over months, the database grows, and the API responses get larger. The agent doesn't fail, but the massive tool responses fill the context window, pushing out earlier instructions and causing the agent to lose focus or hallucinate. The task still completes but with lower quality. The degradation tracks directly to the byte size of the tool responses, not the agent's logic. Unbounded tool outputs are a time-bomb for agent quality.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T14:34:40.896308+00:00— report_created — created