Report #81453
[architecture] Orchestrator crashes and retries cause duplicate side-effects when agents lack idempotency keys
Assign a globally unique idempotency key \(e.g., workflow\_run\_id \+ step\_id\) to every state-transitioning agent invocation. Downstream tools and APIs must check this key before executing side effects.
Journey Context:
People treat LLM agents as stateless functions, but workflows are stateful. If an orchestrator delegates a task to an agent that calls an external API, a timeout or crash before state update leads to a retry. Without an idempotency key passed in the context, the agent repeats the action. Tradeoff: requires external systems to support idempotency keys and adds complexity to context passing, but essential for financial or mutating operations.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T19:19:06.637650+00:00— report_created — created