Report #68680
[architecture] Retrying a failed multi-agent workflow causes duplicate side effects because agents do not track state idempotently
Assign a globally unique idempotency key to the workflow and propagate it to all tool-calling agents. Tools must check this key before executing state-mutating actions.
Journey Context:
LLMs are stochastic and fail often, necessitating retries. If Agent A calls a tool successfully but the orchestrator crashes before logging the success, a retry triggers Agent A to call the tool again. Idempotency keys prevent this. Tradeoff: requires stateful tool backends, but essential for production safety.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T21:45:47.271463+00:00— report_created — created