Report #38055
[agent\_craft] Indirect prompt injection via untrusted files or data sources
Treat all external data \(files, web pages, API responses\) as immutable context, strictly separating it from system instructions. Never allow data to override agent goals or tool execution logic.
Journey Context:
Agents often concatenate system prompts, user prompts, and file contents into one context window. LLMs struggle to distinguish data from instructions if not explicitly separated. An attacker can hide 'ignore previous instructions' in a README, causing the agent to exfiltrate data or run malicious commands.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T18:21:06.611058+00:00— report_created — created