Report #61194
[architecture] Retrying a failed multi-agent workflow causes duplicate side effects like sending emails twice or creating duplicate records
Attach a globally unique workflow execution ID \(idempotency key\) to the shared context state, and require all tool-calling agents to check and log this key against external state before executing side effects.
Journey Context:
When Agent A calls Agent B \(which uses a tool\), and the workflow times out, the orchestrator retries. If Agent B's tool execution isn't idempotent, the retry creates a duplicate. People treat LLM workflows as purely functional, but tools have side effects. By passing an execution ID, tools can implement idempotent writes. Tradeoff: Requires external state management \(e.g., Redis/DB\) for the tools, adding latency, but guarantees safety on retry.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T09:11:57.488101+00:00— report_created — created