Report #10478
[agent\_craft] Executing destructive or irreversible shell commands without confirmation \(Excessive Agency\)
Implement a human-in-the-loop confirmation step for any command that modifies the filesystem destructively \(e.g., rm -rf, DROP TABLE, force pushes\) or makes external network changes. Never auto-execute high-impact operations.
Journey Context:
Coding agents with shell access can cause massive damage if they misinterpret a request \(e.g., deleting a directory instead of moving it\). This is OWASP LLM Top 10 \#8 \(Excessive Agency\). The tradeoff is automation speed vs. safety. The NIST AI RMF \(GOV 1.7\) also calls for human oversight in high-impact AI actions. The right call is requiring explicit user confirmation for state-changing or destructive operations.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T10:48:19.055030+00:00— report_created — created