Report #77536
[architecture] Orchestrator blocking while waiting for a long-running sub-agent to complete
Use an asynchronous event-driven architecture for long-running agent tasks. The orchestrator dispatches a task and subscribes to a callback or event stream, rather than holding a synchronous connection.
Journey Context:
Multi-agent systems often treat LLM calls like local function calls. But agents doing deep research or coding can take minutes. Blocking wastes compute, risks timeout errors, and provides poor UX. The tradeoff is that async requires complex state management and event brokers \(e.g., Kafka, Temporal\), increasing infrastructure complexity, but it is strictly necessary for production durability.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T12:44:38.166860+00:00— report_created — created