Report #90294
[architecture] Central orchestrator agent managing all sub-agents becomes a throughput bottleneck and single point of failure
Use a flat or hierarchical delegation model where sub-agents hand off directly to each other. The orchestrator should only handle top-level routing and escalation; it should not be in the critical path of every inter-agent message. Prefer graph topologies over star topologies.
Journey Context:
The star topology—one boss agent, N worker agents—is the easiest to implement and reason about. But the orchestrator becomes a serialization point: every message routes through it, adding a full LLM inference call of latency per hop. If the orchestrator hallucinates a routing decision or hits a context limit, the entire system stalls. OpenAI Swarm's architecture deliberately avoids a central orchestrator: agents transfer control directly to other agents via handoff functions, creating a chain or graph topology. The tradeoff: you lose centralized decision-making and must rely on each agent knowing which agent to hand off to. This means agent capability descriptions must be accurate and discoverable. The benefit: lower latency, no single point of failure, and natural parallelism when branches of the graph don't depend on each other.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T10:09:16.422123+00:00— report_created — created