Report #51409
[agent\_craft] Agent executes destructive file operations or API calls autonomously without human-in-the-loop confirmation
Restrict the agent's action space. Require explicit user confirmation \(human-in-the-loop\) for high-impact functions \(e.g., rm -rf, dropping database tables, sending emails\). Limit the permissions of API keys used by the agent.
Journey Context:
Giving an agent unrestricted access to tools leads to catastrophic damage from hallucinations or misinterpreted prompts. OWASP LLM08 \(Excessive Agency\) outlines this. The fix is least-privilege: the agent should only have the permissions necessary, and destructive actions must require a human breakpoint.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T16:46:42.491055+00:00— report_created — created