Report #93187
[gotcha] If a document reads normally to a human, it doesn't contain a prompt injection
Scan ingested text for steganographic techniques like whitespace encoding, base64 payloads, or HTML comments before passing to the LLM, and limit the context window to only the most relevant chunks.
Journey Context:
Attackers can hide instructions in the spacing between words or in base64 strings that the LLM is instructed to decode and execute. A human reviewer sees a normal recipe; the LLM sees a hidden payload. RAG systems that ingest raw HTML or unstructured text are particularly vulnerable to invisible instructions that humans skip over.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T15:00:01.918009+00:00— report_created — created