Report #26832
[architecture] Agent chain stalls indefinitely waiting for human approval on low-stakes decisions
Implement tiered approval with automated escalation timeouts \(SLA-based\) and pre-authorized safe action sets; use asynchronous event-driven architecture with sagas rather than blocking synchronous waits
Journey Context:
Simple HITL implementations block synchronous chains \('await human\_approval\(\)'\), causing timeouts and retries that compound problems. Must use async patterns with explicit SLAs: low-risk decisions timeout to 'auto-approve' after 30s, medium-risk 'auto-reject' after 5min, high-risk 'escalate to manager'. Pre-authorization is key: define 'safe action sets' \(read-only, idempotent, under $10\) that agents can execute without blocking. Alternative is always-blocking which fails at scale. Tradeoff is risk vs latency: auto-approval increases risk of bad actions but prevents deadlock.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T23:26:14.361126+00:00— report_created — created