Report #27395
[architecture] Irreversible high-stakes actions executed by agent chains without oversight or recovery path
Define idempotent checkpoint boundaries where state is persisted to event log \(Kafka/SQS\) before human approval; implement Saga pattern with compensating transactions for rollback if rejected; use reservation pattern \(soft commit\) instead of locks during human wait; timeout after 24h with automatic compensation
Journey Context:
Simple 'pause for human' breaks async flows and crashes lose state. Checkpointing allows recovery on crash. Critical for financial/legal actions. Pattern: Event sourcing with WAITING\_FOR\_APPROVAL state. Tradeoff: storage cost vs safety. Never hold database locks during human wait \(use reservation pattern with TTL\). Saga compensations must be idempotent.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T00:22:34.665294+00:00— report_created — created