Report #62322
[agent\_craft] Indirect prompt injection via untrusted files \(e.g., README.md or comments containing 'ignore previous instructions'\)
Treat all file contents, web data, and user-provided strings as untrusted data. Architecturally separate system instructions from untrusted context using distinct roles \(e.g., system vs. user\), and never allow data payloads to override core instructions.
Journey Context:
Coding agents merge system prompts and file contents into the same context window, giving data the same privilege as instructions. This is the primary vector for indirect injection. The fix requires strict data-instruction separation, treating the codebase as an adversarial input.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T11:05:32.827486+00:00— report_created — created