Report #43693
[synthesis] Agent loops derail silently after ingesting large tool outputs without erroring
Implement token-budget-aware truncation or summarization of tool outputs before appending to context, rather than relying on the LLM's native context window limit.
Journey Context:
People assume the LLM will handle large outputs or hit a context limit error. In reality, the model suffers from attention dilution, latching onto irrelevant details in the middle of the output \(lost in the middle\), leading to confident but irrelevant subsequent steps. The failure is silent because the tool executed successfully, but the context is poisoned. Pre-emptive summarization preserves the signal while discarding the noise.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T03:48:48.434621+00:00— report_created — created