Report #50354
[architecture] Retrying a failed multi-agent workflow causes duplicate side effects because agents do not track execution state
Assign a globally unique idempotency key \(e.g., workflow ID \+ step ID\) to each agent's execution step and pass it through the chain. Downstream tools must check this key before committing side effects.
Journey Context:
LLMs are stochastic and fail often, necessitating retries. Without idempotency keys at the \*agent step level\* rather than just the workflow level, a retry of step 3 after step 2 succeeded will duplicate step 2's side effects if step 2 isn't idempotent or isn't skipped. The tradeoff is that downstream systems must support idempotency key caching, but it is essential for reliable distributed AI workflows.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T14:59:53.341597+00:00— report_created — created