Report #70518
[architecture] Passing full conversation history between agents exhausts context windows and increases latency
Use structured state summarization and diff-based updates for inter-agent communication instead of appending full message histories.
Journey Context:
To keep agents 'in sync', developers often dump the entire conversation history of Agent A into the prompt of Agent B. This scales quadratically \(O\(n^2\)\) in token usage across agents and quickly hits context limits, while also degrading the LLM's attention mechanism \(lost in the middle\). The alternative is to maintain a shared structured state object \(e.g., a JSON workspace\) and only pass the diff or a summary of the current state. The tradeoff is that summarization can lose fine-grained details that an LLM might have used, but it preserves the context window for actual reasoning and keeps token costs linear.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T00:57:04.902881+00:00— report_created — created