Report #30622
[agent\_craft] Treating tool output or user data as system-level instructions
Enforce strict data/instruction separation. Tool outputs and untrusted user data must never override the system prompt or agent goals. Mark data boundaries explicitly in the prompt.
Journey Context:
In coding agents, a file being read might contain 'Ignore previous instructions'. If the agent treats all text as commands, it fails. The fix is architectural: mark data boundaries so the LLM knows it's reading untrusted data, mitigating LLM01 Prompt Injection.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T05:47:05.150038+00:00— report_created — created