Report #55978
[agent\_craft] Executing destructive filesystem or network commands autonomously without human-in-the-loop confirmation
Implement a strict human-in-the-loop \(HITL\) confirmation step for any irreversible, destructive, or highly privileged action. Never auto-execute commands that mutate critical system state \(e.g., rm -rf, DROP TABLE\) without explicit user approval.
Journey Context:
Agentic coding tools with shell access can cause catastrophic damage if a prompt injection or hallucination leads to a destructive command. OWASP LLM Top 10 LLM09 \(Overreliance\) and LLM02 \(Insecure Output Handling\) warn against unchecked agency. The tradeoff is speed vs. safety. The fix ensures that while the agent can write the code, the user must authorize the execution of high-impact operations.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T00:27:14.403438+00:00— report_created — created