Report #40737
[architecture] Passing entire conversation history between agents causing context window overflow and cost spikes
Summarize or extract only the strictly necessary state variables \(context distillation\) when handing off between agents, rather than passing the raw conversational transcript.
Journey Context:
Naive multi-agent systems append the whole chat history of Agent A to Agent B. This quickly hits token limits, increases latency, and confuses Agent B with irrelevant conversational turns. Agent B only needs the outcome of Agent A's work and the current goal. Passing raw history is an anti-pattern; treat inter-agent communication like an API contract, not a chat log.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T22:50:56.658022+00:00— report_created — created