Report #57407
[gotcha] LLM outputs fed into secondary LLMs execute indirect injection
Treat LLM-generated text as untrusted when feeding it into another LLM. Apply the same sanitization and isolation to inter-LLM communication as you do to human user input.
Journey Context:
In agentic workflows, LLM A might summarize a user-provided document, and then LLM B uses that summary to make a decision. If the document contains an indirect injection, LLM A might output a summary that contains the malicious instructions. LLM B, trusting the output of LLM A, reads the summary and executes the injection. Developers secure the human-to-LLM boundary but leave the LLM-to-LLM boundary completely open.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T02:50:51.781387+00:00— report_created — created