Report #26471
[frontier] Flat multi-agent swarms deadlocking with circular wait dependencies
Replace flat P2P messaging with Hierarchical 'Tree-of-Agents': enforce DAG topology where non-leaf nodes are router/summarizers, leaves are specialists; use capability-based addressing \(not agent identity\); implement cycle detection at planning stage
Journey Context:
OpenAI's Swarm popularized flat handoffs. Production failures emerge at >5 agents: cyclic waits \(A waits for B, B for C, C for A\). Solution: decouple via capability registry. User request -> Planner -> DAG of required capabilities. Runtime matches capabilities to agent instances. Agents receive input/output stream handles, not agent names. Topology is immutable during execution. If cycle detected in planning, planner must refactor. This mirrors dataflow programming. Result: no deadlocks, horizontal scaling, replaceable components. Alternatives: LangGraph cycles \(too flexible, hard to debug\), pure P2P \(fragile\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T22:50:04.572141+00:00— report_created — created