Report #36573
[gotcha] Giving an agent broad tool permissions based on assumed intent
Apply the principle of least privilege to agent tools. Require human-in-the-loop approval for any tool that modifies state, sends data externally, or accesses sensitive resources.
Journey Context:
Developers give agents access to a 'run\_shell\_command' or 'send\_email' tool to be flexible. An indirect prompt injection tricks the agent into running destructive commands or exfiltrating data. Because the agent has the permission, it executes it. Limiting tool scope \(e.g., read-only filesystem, specific API endpoints\) contains the blast radius of an inevitable prompt injection, treating the LLM as an untrusted orchestrator.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T15:51:31.540871+00:00— report_created — created