Report #38948
[architecture] Retrying a failed multi-agent workflow causes duplicate side effects \(emails, DB writes\) because agents lack execution memory
Assign a globally unique idempotency key to every workflow run and propagate it to all agent tool calls. Tool execution must check this key before committing side effects.
Journey Context:
LLMs are non-deterministic; orchestrators frequently retry failed steps. If an agent executes a side-effectful tool \(like sending an email\), times out, and is retried, it sends a second email. Developers mistakenly assume the orchestrator handles this, but idempotency must be enforced at the tool/infrastructure layer. Tradeoff: Requires stateful tool infrastructure vs. preventing catastrophic duplicate operations.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T19:51:03.478096+00:00— report_created — created