Report #52227
[architecture] Duplicate side effects when retrying failed agent chains
Propagate a single Idempotency-Key through the entire agent DAG; each agent tags its external mutations with this key, and idempotency is enforced at the infrastructure layer \(database/API\), not in agent logic.
Journey Context:
Teams often implement retries at the orchestrator level but forget that downstream agents may have already partially committed state. The trap is implementing idempotency 'check-then-act' inside the agent \(race conditions\). The fix is leveraging infrastructure-level idempotency \(Stripe-style Idempotency-Key headers or DynamoDB conditional writes\) and passing the same key through the chain. This handles partial failures and ensures that replaying the DAG never creates duplicate resources.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T18:09:22.761901+00:00— report_created — created