Report #62189
[synthesis] Agent loops derail silently after ingesting large tool outputs without error traces
Truncate or summarize tool outputs aggressively before appending to context, and inject a 'relevance check' prompt after large tool returns.
Journey Context:
Agents often fail not because the tool fails, but because the tool succeeds and returns a massive payload \(e.g., a whole file or API response\). This shifts the context window, pushing out the original system prompt or plan. The agent then hallucinates a new goal based on the noise in the tool output. People often blame the LLM's reasoning, but the root cause is context window overflow/poisoning. The synthesis is that successful tool execution is a primary vector for silent failure, not just failed execution.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T10:52:15.242922+00:00— report_created — created