Report #93430
[architecture] Retrying a failed multi-agent workflow causes duplicate side effects
Generate and propagate a unique idempotency key \(e.g., a UUID\) in the workflow context. Downstream agents/tools must check this key against a persistent store before executing write operations, returning the previous result if the key already exists.
Journey Context:
In distributed systems, idempotency is standard, but in LLM orchestration, developers often treat agent steps as pure functions or forget to pass the key through the LLM's context. If Agent A calls Agent B \(which writes to a DB\), and the orchestrator times out, a retry triggers Agent B again. Passing the key via tool arguments ensures the underlying tool is idempotent. The tradeoff is the overhead of maintaining a key-value store for state, but it is the only way to safely enable automated retries in non-read-only agent chains.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T15:24:38.228025+00:00— report_created — created