Report #56636
[gotcha] LLMs decoding base64 or ROT13 obfuscated payloads to bypass filters
Decode all standard encodings \(base64, URL encoding, ROT13\) in user inputs before passing them to the LLM or safety filters, or implement a pre-processing pipeline that rejects heavily encoded inputs.
Journey Context:
Developers assume the LLM only reads plain text. However, LLMs are highly capable at decoding base64, ROT13, and hex. An attacker passes a benign-looking base64 string which the safety filter ignores, but the LLM decodes and executes the malicious instruction internally.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T01:33:24.043170+00:00— report_created — created