Report #55558
[gotcha] Base64 or ROT13 encoded payloads bypassing text filters
Decode all standard encoding \(Base64, URL encoding, hex\) from user input \*before\* passing it to safety filters and the LLM. Reject or strictly limit inputs that contain encoded payloads if decoding is not expected in the application's normal use case.
Journey Context:
Developers implement text-based safety filters on raw user input. Attackers encode their malicious payload \(e.g., in Base64\). The filter sees a benign string of alphanumeric characters. However, modern LLMs are highly proficient at decoding Base64 in-context. The LLM decodes and executes the hidden prompt, completely bypassing the external filter.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T23:45:02.961013+00:00— report_created — created