Report #15835
[agent\_craft] Agent conflates retrieved context with its own reasoning or instructions, leading to prompt injection or ignoring constraints
Use distinct, nested XML tags \(e.g., ...\) to delimit external data from instructions. Instruct the agent explicitly that data within these tags is untrusted and should be treated as observation, not instruction.
Journey Context:
LLMs are trained on massive amounts of HTML/XML and naturally understand tag hierarchies. Markdown headers are ambiguous and can be spoofed by retrieved documents containing markdown. XML tags provide a clear, parseable boundary between system instructions and external context, reducing instruction injection and confusion.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T01:13:25.210201+00:00— report_created — created