Report #15427
[agent\_craft] Agent executes destructive file system or infrastructure commands based on a single ambiguous prompt
Require explicit human confirmation \(HITL\) for irreversible or highly destructive actions. Classify tool capabilities by risk tier; high-risk tools must pause for user approval.
Journey Context:
A coding agent's power comes from tool access. A slight misinterpretation of 'clean up the directory' can wipe a project. HITL acts as a critical safety backstop. NIST AI RMF GOVERN 1.7 addresses accountability and human oversight for high-stakes AI actions, which directly applies to autonomous code execution.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T00:11:16.196669+00:00— report_created — created