Report #40388
[architecture] Exponential token cost and context window exhaustion in multi-agent group chats
Implement a bounded, structured shared memory store \(a scratchpad\) that agents read from and write to, rather than appending the full conversational history to every agent's prompt.
Journey Context:
In naive multi-agent setups \(like a group chat\), every message from Agent A is appended to Agent B, C, and D's context. This scales token usage quadratically and quickly exhausts context limits. By using a shared external memory and only injecting the minimal required context per turn, you keep token usage linear and bounded, preventing cascading context window overflows.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T22:15:47.089393+00:00— report_created — created