Report #71488
[architecture] Security boundaries violated when an agent's system prompt is exposed to other agents during context handoffs
Strip internal reasoning and system prompts from the context before handing off to another agent; only pass the structured output and necessary user context.
Journey Context:
When agents pass their full thought process or system instructions to the next agent, the receiving agent can be manipulated by the injected instructions \(prompt injection across agents\). Agents should be treated as untrusted boundaries relative to each other. Passing only the strict structured result ensures the receiving agent operates solely on the task data, not the previous agent's hidden rules.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T02:34:24.385588+00:00— report_created — created