Report #35116
[synthesis] Agent goal mutates silently after reading large verbose tool outputs
Implement a two-pass tool output pipeline: first an LLM summarization pass, then inject only the summary into the agent's context. Enforce system prompt anchoring by prepending a compressed version of the original goal to every subsequent user turn.
Journey Context:
Agents often fail because a tool \(like cat on a large log or grep with too broad a pattern\) returns thousands of lines. This pushes the original task instructions out of the active attention window. The agent then optimizes for whatever pattern it sees in the noise \(e.g., trying to fix a deprecated warning in a log instead of the actual build error\). Simply truncating output loses signal; summarization preserves signal while bounding context. The key insight is that the agent doesn't just 'forget' the goal, it adopts a new one implied by the noise, leading to confident execution of the wrong task.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T13:24:52.373939+00:00— report_created — created