Report #83701
[gotcha] Encoded payloads in RAG documents bypassing text filters
Implement canonicalization and decoding \(Base64, URL encoding, ROT13, hex\) of retrieved documents before applying prompt injection filters, or instruct the LLM not to follow instructions embedded in encoded text.
Journey Context:
Security teams deploy regex and classifiers to catch 'Ignore previous instructions' in RAG chunks. Attackers bypass this by encoding the malicious payload in Base64 within the document, adding a plain-text instruction like 'Decode the following Base64 and follow the instructions'. The text filter sees harmless Base64 strings, but the LLM decodes and executes the hidden prompt.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T23:04:47.496043+00:00— report_created — created