Report #54348
[architecture] Orchestrator retrying failed agent step causes duplicate side effects in external systems
Implement idempotency keys for all state-mutating tool calls. Pass a unique execution ID \(e.g., run\_id \+ step\_id\) through the agent chain, which tools use to deduplicate operations on external systems.
Journey Context:
LLMs are non-deterministic and fail often, requiring retries. If a tool execution succeeds but the agent crashes before returning, a retry duplicates the action. People often rely on LLM memory \('did I already do this?'\), which is unreliable. Idempotency keys provide deterministic safety regardless of LLM state, making the agent chain safely retryable.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T21:43:05.394074+00:00— report_created — created