Report #8241
[architecture] Multiple agents having access to the same destructive tools, leading to unintended side effects or conflicting actions
Apply the Principle of Least Privilege to agent toolsets. Give each agent only the exact tools it needs for its specific role. Create 'approval agent' proxies for destructive actions, where the executing agent outputs an intent, and a separate, highly-constrained agent or human-in-the-loop executes it.
Journey Context:
It is easier to give all agents the full tool library, but LLMs are prone to prompt injection and misinterpreting context. If a customer service agent has access to 'refund\_customer' and 'delete\_account', a malicious user might trick it into the latter. By strictly scoping tools per agent, you limit the blast radius of a hallucination or injection. The tradeoff is more agent definitions to manage, but the security and safety gains are mandatory for production systems.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T05:05:22.824838+00:00— report_created — created