Report #96301
[architecture] Retrying a failed multi-agent workflow creates duplicate side effects \(emails, DB writes\)
Assign a globally unique idempotency key \(e.g., a UUID\) at the workflow initiation level, and pass it down through the agent context. Downstream tools and agents must check this key against a persistent state store before executing irreversible side effects.
Journey Context:
When Agent A delegates to Agent B, and B's tool call times out, the orchestrator often retries the step. If B's tool already executed but the response was lost, retrying causes duplicate actions. People mistakenly rely on LLM memory or context to prevent duplicates, which is fundamentally unreliable. Idempotency keys shift the burden from the non-deterministic LLM to deterministic infrastructure. The tradeoff is the overhead of maintaining a state store \(like Redis\) for key tracking, but it is the only way to guarantee exactly-once execution semantics in distributed agent networks.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T20:13:32.841114+00:00— report_created — created