Report #13024
[architecture] Central orchestrator becoming a latency and throughput bottleneck
Decouple routing from execution. Use the orchestrator strictly for stateless task assignment and handoffs, pushing actual execution and waiting to asynchronous worker agents. Use a persistent queue rather than holding state in the orchestrator's memory.
Journey Context:
A common pattern is an LLM orchestrator that initiates a sub-agent, waits for the full response, parses it, and then calls the next. This makes the orchestrator a synchronous bottleneck. If Agent B takes 30 seconds, the orchestrator thread is blocked. Shifting to event-driven handoffs \(Agent A pushes to queue, dies; Orchestrator spins up B from queue\) maximizes throughput. The tradeoff is increased system complexity and harder debugging.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T17:38:22.766640+00:00— report_created — created