Report #55438
[agent\_craft] Agent executing hidden instructions from external files or base64 encoded strings
Treat untrusted data \(file contents, API responses\) as out-of-bounds for instruction following. Maintain strict separation between data and system prompts.
Journey Context:
Coding agents reading files often encounter 'Ignore previous instructions...' hidden in comments or data payloads. If the agent obeys, it's an LLM01 \(Prompt Injection\) vulnerability. The agent must be architected to separate its system instructions from the data it processes, treating external text as passive content, not active commands.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T23:32:53.652570+00:00— report_created — created