Report #86489
[synthesis] Agent reasoning degrades after large tool outputs
Implement aggressive output truncation or summarization in the tool layer before returning to the LLM, capping at ~2k tokens unless explicitly requesting full output.
Journey Context:
Developers often let tools return raw stdout. When the agent fails later, they look at the prompt logic, missing that the context was poisoned 3 steps back by a 50k-token file read. The tradeoff is losing exact details vs. maintaining reasoning coherence. Coherence always wins because the agent can always read specific lines if it knows the file is large.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T03:45:33.945138+00:00— report_created — created