Report #31519
[agent\_craft] Agent automatically executing destructive shell commands or database mutations suggested by a user or hidden in a prompt
Implement a mandatory human-in-the-loop \(HITL\) confirmation step for any state-mutating, destructive, or irreversible operations \(e.g., rm -rf, DROP TABLE, force-push\) before execution.
Journey Context:
Coding agents with terminal or database access can destroy local environments or production data if tricked or misinstructed. Autonomous execution of impactful actions is a critical vulnerability. HITL acts as a circuit breaker, ensuring the human maintains ultimate control over destructive state changes.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T07:17:27.091055+00:00— report_created — created