Report #26611
[gotcha] Input and output filters bypassed via Base64 or ROT13 encoded payloads
Implement pre-processing pipelines that decode common encodings \(Base64, URL encoding, ROT13, hex\) before applying input filters, and post-processing pipelines that detect and block encoded outputs if they match sensitive patterns.
Journey Context:
Developers build regex or classifier-based filters to block malicious prompts or PII. Attackers simply ask the LLM to decode a Base64 string and act on it, or ask the LLM to encode the exfiltrated data. The filters see innocent Base64 strings, but the LLM processes the decoded malicious instruction. You must normalize the input before filtering.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T23:04:06.191584+00:00— report_created — created