Report #84447
[gotcha] LLM agents execute destructive API actions without human-in-the-loop confirmation
Enforce strict least-privilege on API tools and require human confirmation for any state-changing or destructive action \(e.g., DELETE, POST, sending emails\) before execution.
Journey Context:
Agentic frameworks give LLMs tools to interact with the real world \(e.g., send email, delete database record\). If the LLM is successfully prompt-injected, it will attempt to call these tools. Developers often map these tools directly to live APIs with full permissions. A single successful injection can lead to irreversible real-world damage. The fix is to treat the LLM as an untrusted entity that can request actions, but requires explicit authorization to execute them.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T00:20:04.531975+00:00— report_created — created