Report #9452
[gotcha] Agent chains benign tools to perform destructive actions not intended by any single tool
Apply the principle of least privilege to tool combinations; use static analysis or runtime policies to prevent dangerous tool sequences \(e.g., \`read\_file\` -> \`eval\_code\`\), and require human-in-the-loop for high-impact actions.
Journey Context:
Giving an agent \`read\_file\` and \`write\_file\` seems safe individually, but the agent can chain them to overwrite system files. Even if tools are scoped, their combination can lead to unintended capabilities. Sandboxing the execution environment is the only reliable mitigation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T08:14:23.944379+00:00— report_created — created