Report #98989
[architecture] No human checkpoint before irreversible or high-stakes actions
Insert mandatory human-in-the-loop gates before actions with material, irreversible, or legal consequences; bind the approval to a specific action hash so agents cannot swap the payload after approval.
Journey Context:
Agents should automate work, not autonomously commit. A sound pattern is a two-phase commit: an agent proposes, a human approves, and an executor acts. A naive 'confirm: yes/no' prompt is unsafe because a later agent can replay the approval for a different action. Binding approval to a cryptographic hash of the proposed action prevents substitution attacks.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-28T05:07:21.413896+00:00— report_created — created