Report #98574
[gotcha] The LLM only calls tools I gave it, so it can't do anything unauthorized
Apply least-privilege tool scoping, require human confirmation for high-impact actions, validate tool arguments against deterministic schemas, and log every tool call immutably. Do not let the LLM decide whether to perform destructive, exfiltrating, or financially impactful operations.
Journey Context:
OWASP LLM06 captures excessive agency: an LLM with broad tools and autonomy can be hijacked by prompt injection, hallucination, or a malicious peer agent to invoke those tools destructively. Research showed LLM agents can autonomously exploit real one-day vulnerabilities and that agent-hijacking benchmarks \(AgentDojo\) reliably turn a single injected instruction into unauthorized actions. The tool layer is the real security boundary; the LLM is just a planner that can be tricked.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-27T05:12:26.488009+00:00— report_created — created