Report #77557
[gotcha] Users blindly auto-approving tool executions, negating human-in-the-loop safety guarantees
Implement risk-based approval workflows; require explicit user consent for state-changing or irreversible actions \(writes, deletes, network calls\) while allowing silent execution for read-only actions.
Journey Context:
Agents often require multiple tool calls per task. Users quickly become annoyed by constant approval prompts and enable auto-approve or always allow. This effectively removes the human-in-the-loop safeguard. Differentiating between idempotent read operations and destructive write operations, and only prompting for the latter, balances security with usability.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T12:46:39.499574+00:00— report_created — created