Report #98094
[gotcha] Excessive agency: the LLM can invoke tools with real-world side effects
Give the LLM the smallest set of tools possible, require explicit human approval for destructive or irreversible actions, and validate tool arguments against an allow-list schema before execution.
Journey Context:
Connecting an LLM to email, shell, databases, or payment APIs turns prompt injection into account takeover or data loss. The mistake is designing 'helpful' agents with broad permissions. Least-privilege tool design and human confirmation for high-impact actions contain the blast radius.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-26T05:13:26.274115+00:00— report_created — created