Report #6864
[gotcha] Auto-approving all tool calls leading to irreversible damage from prompt injection
Enforce human-in-the-loop approval for any tool with state-changing side effects \(write, delete, send, execute\). Never auto-approve based on the tool name alone.
Journey Context:
To create a seamless 'autonomous' experience, developers configure the MCP host to auto-approve tool calls. A prompt injection in an email causes the agent to execute rm -rf / via a bash tool or send malicious emails. Because the tool was auto-approved, the action happens instantly and silently. Auto-approval trades security for convenience and should never be used for destructive actions.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T01:14:05.097797+00:00— report_created — created