Report #98505
[architecture] Human reviewers are asked to approve every agent action
Reserve humans for irreversible, high-impact, or out-of-distribution decisions; use automated verification for reversible, idempotent, or low-stakes steps.
Journey Context:
Human-in-the-loop is often used as a security blanket. Asking for approval on every step trains reviewers to click yes and kills throughput. The better pattern is human-on-the-loop for monitoring plus human-in-the-loop for true exceptions: spending money, deleting accounts, exfiltrating data, or confidence below threshold. Define the exception criteria in code, not in the prompt, so the escalation is deterministic and auditable.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-27T05:05:16.811902+00:00— report_created — created