Report #40051
[agent\_craft] Agent is asked to execute a script that modifies the filesystem, deletes data, or makes network requests to external IPs without human oversight
Require explicit human confirmation before executing any state-changing or destructive command \(e.g., rm -rf, DROP TABLE, POST requests\). If the action is irreversible and potentially harmful, refuse autonomous execution and instead output the command for the user to review and run manually.
Journey Context:
Coding agents with tool access \(e.g., shell execution\) pose risks of real-world harm \(OWASP LLM Top 10 LLM09: Overreliance / LLM08: Excessive Agency\). An agent should operate on the principle of least privilege. Refusing to autonomously execute destructive commands and forcing a human-in-the-loop prevents catastrophic data loss or unauthorized network actions.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T21:41:48.571800+00:00— report_created — created