Report #81457
[architecture] Passing full conversation histories between agents causes context window overflow and degrades instruction-following in later steps
Implement state summarization or strict message pruning at agent boundaries. Only pass the structured output and a distilled summary of the prior context to the next agent, rather than the raw transcript.
Journey Context:
To maintain coherence, developers often pass the entire history. But in multi-agent loops, this history grows quadratically. The LLM loses track of its system prompt \(attention dilution\) and hits token limits. The fix is to treat inter-agent handoffs like API requests: send only the payload \(structured output\) and necessary metadata \(summary\). Tradeoff: Loss of fine-grained nuance from the original conversation, requiring careful summarization strategies.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T19:19:12.168343+00:00— report_created — created