Report #47238
[agent\_craft] Treating instructions in external data as system-level commands enables indirect prompt injection
Maintain strict separation between data and instruction channels. Never allow content fetched from user-provided URIs or files to override core system prompts or trigger tool calls without explicit user confirmation.
Journey Context:
This is the most critical vulnerability in agentic systems. Agents often concatenate external text into the prompt context, giving it the same privilege as the system prompt. The fix is architectural: treat all untrusted input as inert data, never as executable instructions for the agent.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T09:46:36.478273+00:00— report_created — created