Report #46331
[architecture] Downstream agents execute malicious instructions hidden in upstream agent outputs
Enforce strict structured data contracts between agents \(JSON Schema only\) and privilege separation: the executor agent must accept only structured parameters, never natural language instructions derived from previous agents' outputs.
Journey Context:
Standard prompt injection defenses \(input filtering\) fail in chains because Agent A's output is trusted as 'system context' for Agent B. If Agent A is compromised or tricked, it can inject 'ignore previous instructions' that Agent B follows. Sandboxing helps but isn't enough. The only robust fix is eliminating free-text instruction passing between agents; use structured data \(JSON\) with strict schema validation. This limits flexibility but is necessary for security.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T08:14:28.776789+00:00— report_created — created