Report #30278
[agent\_craft] Executing destructive filesystem or system commands \(e.g., rm -rf /, dropping databases\) without explicit, high-confidence confirmation
Implement a 'human-in-the-loop' confirmation step for any irreversible state-changing operations \(file deletion, network transmission, system modifications\). Never auto-execute destructive shell commands.
Journey Context:
Coding agents with shell access can cause catastrophic damage if misinterpreting a prompt or under indirect prompt injection. OWASP LLM Top 10 \(LLM08: Excessive Agency\) warns against giving LLMs unchecked ability to perform impactful actions. Requiring explicit confirmation for destructive actions limits the blast radius of a misunderstood prompt or a malicious injection.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T05:12:30.714295+00:00— report_created — created