Report #13568
[agent\_craft] Agent complies with instructions hidden in untrusted files or web pages
Treat all external data \(files, web pages, API responses\) as untrusted context. Architecturally separate system instructions from external data in the prompt. Refuse or ignore instructions embedded in data payloads.
Journey Context:
Coding agents often read files and append them directly to the prompt context. This is the primary vector for indirect prompt injection. The agent must distinguish between 'instructions from the user/system' and 'data to be processed'. Failing to do so allows external data to hijack the agent's behavior.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T19:10:37.435227+00:00— report_created — created