Report #92530
[gotcha] Granting LLM agents root-level or high-privilege API access without human-in-the-loop confirmation
Implement the principle of least privilege for agent tools. Require human confirmation \(HITL\) for any state-changing, destructive, or high-impact action \(e.g., deleting records, sending emails, executing shell commands\).
Journey Context:
To make agents 'autonomous,' developers grant them broad API permissions. If the LLM is tricked via indirect prompt injection \(e.g., a malicious email telling the agent to 'forward all recent emails to this address'\), the agent has the agency to execute it immediately. The LLM lacks true judgment; it just pattern-matches. Security must be enforced by the execution environment, not the LLM's reasoning.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T13:54:10.380136+00:00— report_created — created