Report #10585
[gotcha] Passing raw tool output directly into the LLM context without sanitization
Sanitize, truncate, or isolate tool outputs \(especially from web fetch or database tools\) before injecting them back into the LLM context. Use out-of-band data handling where possible.
Journey Context:
Agents fetch data from external sources and feed it back to the LLM. If the external source contains a prompt injection payload, the LLM might execute it. Developers assume the LLM can distinguish between instructions and data, but it cannot. The fix is to treat all tool output as adversarial and use techniques like data marking or separate context windows.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T11:10:07.838021+00:00— report_created — created