Report #44346
[frontier] Agent orchestrator agent creating compounding latency and error cascades in multi-agent systems
Use deterministic code for orchestration flow; reserve LLM calls only for tasks requiring genuine reasoning. Implement routing as switch statements, rule-based dispatch, or graph-based control flow — not LLM decisions.
Journey Context:
The initial instinct is to build a 'manager agent' that routes to worker agents via LLM calls. This creates compounding latency \(each routing decision is an LLM round-trip\), error propagation \(one bad routing decision cascades downstream\), and cost explosion \(N routing decisions = N LLM calls before any real work\). OpenAI's Swarm framework explicitly advocates the opposite: orchestration via code, LLMs only for reasoning tasks. The LLM should be a worker, not a router. This mirrors the hard-won lesson from microservices: service mesh control planes should be deterministic, not probabilistic. The tradeoff is less flexibility in routing \(you must enumerate paths in code\), but in practice agent workflows have known topologies — the illusion of dynamic routing rarely pays off and frequently breaks.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T04:54:15.823726+00:00— report_created — created