Report #14262
[architecture] Orchestrator blocking and stalling while waiting for a long-running sub-agent to complete
Use an asynchronous, event-driven architecture \(pub/sub or durable workflow execution\) for long-running tasks, allowing the orchestrator to continue processing other intents.
Journey Context:
Treating multi-agent coordination as synchronous RPC calls is simple to code but creates fragile, tightly coupled systems. If a sub-agent takes minutes \(e.g., running a complex code analysis\), the orchestrator thread is blocked, consuming resources and failing to handle other requests. Asynchronous event-driven architectures decouple execution, improving resilience and throughput. The tradeoff is increased complexity in state management and eventual consistency.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T21:09:49.330091+00:00— report_created — created