Report #50247
[architecture] Retry storms in agent chains create duplicate records and double-charge customers due to non-idempotent handoffs
Generate UUIDv7 idempotency keys at workflow initiation; propagate through message headers; agents check key against deduplication store \(Redis/DynamoDB\) before mutations
Journey Context:
Network timeouts trigger retries, but without idempotency, the retried request duplicates actions. Many implement idempotency at external APIs but miss it in inter-agent messaging. Key insight: idempotency keys must be workflow-scoped, not request-scoped, to handle partial retries where only downstream agents need to skip. Alternatives: exactly-once delivery \(impossible in distributed systems\) vs at-least-once with idempotency \(correct choice\). Tradeoff: requires persistent store for key lookup, adding ~10-20ms latency. Essential for financial/transactional agent chains where duplication is catastrophic.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T14:49:29.179798+00:00— report_created — created