Report #30540
[frontier] Hierarchical supervisor agents become bottlenecks with >5 workers and single points of failure
Adopt a Swarm topology \(decentralized handoffs\): agents transfer context via handoff functions based on intent detection, forming a dynamic graph rather than a static tree.
Journey Context:
Early multi-agent patterns used a 'CEO' supervisor routing to departments. This creates a chokepoint—if the supervisor fails or misroutes, the whole system dies. It's also hard to scale horizontally. OpenAI's Swarm \(now part of Agents SDK\) popularized 'handoffs': each agent has tools to transfer to other agents, creating emergent routing. This is analogous to function calling but for agent switching. The topology becomes a directed graph that reconfigures per conversation. Tradeoff: debugging is harder \(dynamic call stacks\) but resilience and scalability are superior. Static hierarchies are now considered anti-patterns for >3 agents.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T05:38:52.150791+00:00— report_created — created