Report #85608
[architecture] Retrying a failed multi-agent workflow causes duplicate side effects \(e.g., duplicate API calls, double charges\)
Assign a globally unique idempotency key to the workflow and pass it down through agent context; agents must check/write to a shared state store before executing side effects.
Journey Context:
LLM calls are non-deterministic. When an agent crashes after acting but before responding, orchestrators naturally retry. Without idempotency keys, the retry duplicates the action. Tradeoff: requires external stateful storage \(Redis/DB\) which complicates stateless agent design, but is strictly necessary for safe, fault-tolerant retries.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T02:16:56.513741+00:00— report_created — created