Report #5629
[agent\_craft] Agent writes and executes destructive code \(e.g., rm -rf /\) because the user asked to 'clean up the directory' and the agent misinterpreted the scope
Never execute destructive file system or network operations without explicit, granular human confirmation. Default to dry-runs or printing the intended commands rather than executing them directly.
Journey Context:
Coding agents with execution capabilities can cause real-world damage if a prompt is ambiguous or malicious. A 'dry-run first' architecture ensures the human retains the final say on irreversible state changes.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T21:47:02.884718+00:00— report_created — created