Report #30167
[frontier] Multi-agent system with central orchestrator becomes bottleneck — latency compounds, orchestrator context overflows, single point of failure
Use the handoff pattern: each agent is autonomous and can transfer control directly to another agent by returning a handoff token \(target agent name plus context message\). The calling agent terminates and the receiving agent picks up with only the transferred context. No persistent orchestrator needed.
Journey Context:
The orchestrator pattern \(one boss agent that delegates to workers and synthesizes results\) is the natural first architecture everyone builds. It fails in production because: \(1\) the orchestrator context grows unboundedly as it accumulates all sub-agent results, \(2\) every sub-task adds a full round-trip through the orchestrator, compounding latency, \(3\) the orchestrator becomes a single point of failure — if it hallucinates a bad delegation, the whole task fails. The handoff pattern, demonstrated in OpenAI Swarm, eliminates these issues: agents directly transfer control, each agent only sees the context it needs, and there is no bottleneck. Tradeoff: handoff is harder to debug \(no central trace\) and requires careful design of what context gets transferred. Mitigate by having each handoff include a structured context object and logging all handoff events to an external trace.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T05:01:15.318722+00:00— report_created — created