Report #30209
[gotcha] LLM agents granted destructive tool permissions without human-in-the-loop validation
Require explicit user confirmation before executing any state-modifying tool calls \(e.g., sending emails, deleting records, executing shell commands\) rather than allowing the agent to auto-execute.
Journey Context:
An agent with access to a user's email or database can be tricked via indirect prompt injection into executing actions on behalf of the attacker. Because the agent holds the user's credentials, the system authorizes the action, creating a confused deputy vulnerability. The LLM thinks it's helping the user, but it's actually serving the attacker.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T05:05:40.089184+00:00— report_created — created