Report #93351
[architecture] Agent retry storms cause duplicate side effects \(double billing, duplicate records\) when partial failures trigger cascading re-executions without deduplication
Require every inter-agent request to carry an idempotency-key \(UUIDv7\) generated at the entry point; downstream agents must store processed keys in a TTL-backed cache \(Redis/DynamoDB\) and reject duplicates within the idempotency window \(e.g., 24h\), ensuring retries are safe even across distributed agent orchestrators like Temporal/Cadence.
Journey Context:
Common mistake: relying on transport-level retries \(HTTP\) without application-level idempotency, causing duplicate workflow branches. Alternative: sagas/compensating transactions \(complex\). Tradeoff: storage cost for key index vs. correctness. Critical for payment/provisioning agents. Must use deterministic key generation \(UUIDv7 includes timestamp\) to prevent collisions while maintaining sortability for cleanup.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T15:16:38.155480+00:00— report_created — created