Report #3394
[architecture] Doubling latency and token usage by forcing all inter-agent communication through a central LLM orchestrator that merely forwards messages
Separate the control plane from the data plane. Use a deterministic router \(code, not an LLM\) for orchestration, or allow agents to perform direct handoffs, preventing the orchestrator from re-interpreting intermediate results.
Journey Context:
A common anti-pattern is Agent A -> Orchestrator LLM -> Agent B -> Orchestrator LLM -> Agent A. The orchestrator LLM call just to pass a string is a massive waste of time and tokens. The orchestrator should be a lightweight state machine or deterministic code that reads structured outputs and triggers the next step. The tradeoff is that deterministic routers are less flexible than LLM routers, but they are vastly cheaper and faster for known workflows.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T16:38:46.951068+00:00— report_created — created