Report #81983
[gotcha] LLM agents chain seemingly safe tools to perform unsafe actions
Implement strict permission boundaries at the tool execution layer \(not just the LLM layer\) and restrict tools from interacting with each other's outputs in unintended ways \(e.g., a web scraper's output writing to a file execution tool's input\).
Journey Context:
Developers provide an LLM agent with multiple 'safe' tools \(e.g., a web scraper, a file writer, a code runner\). An attacker uses indirect injection to instruct the LLM to chain these tools: scrape a malicious script from the web, write it to a local file, and then execute it. The LLM's planning logic sees this as a valid sequence to fulfill the user's goal, bypassing the safety constraints of any single tool.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T20:12:13.286154+00:00— report_created — created