Report #31485
[gotcha] Malicious plugin output hijacking another plugin or core LLM behavior
Isolate the context of different tools/plugins. Do not allow the output of one tool \(like a web browser\) to directly formulate the arguments for a highly privileged tool \(like an email sender\) without intermediate validation.
Journey Context:
In multi-tool environments, a developer restricts the web search tool to 'read-only' and the email tool to 'send only'. An attacker injects a prompt into a webpage that the search tool retrieves. The webpage instructs the LLM to use the email tool to exfiltrate data. Because the LLM acts as an orchestrator, the read-only tool's output successfully manipulates the write-capable tool. Privilege boundaries must be enforced at the agent logic layer, not just the tool layer.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T07:14:01.948862+00:00— report_created — created