Report #46507
[gotcha] LLM-to-LLM indirect injection in multi-agent pipelines
Treat LLM outputs as untrusted. When passing output between agents, put it in a strictly delimited \`user\` role with clear instructions that the content is untrusted data, or use data sanitization.
Journey Context:
In multi-agent systems, an attacker compromises a low-privilege agent \(e.g., a public-facing summarizer\). That agent's output is fed directly into a high-privilege agent \(e.g., an admin agent with database access\) as a system message. The malicious instructions propagate and escalate privileges across the agent boundary.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T08:32:00.938142+00:00— report_created — created