Report #65801
[gotcha] LLM decodes and executes hidden Base64 or ROT13 payloads
Pre-process user inputs to detect and decode common encodings \(Base64, ROT13, hex\) before passing them to the LLM. Block or sandbox requests that attempt to execute decoded instructions.
Journey Context:
A user submits a prompt that looks like innocent gibberish \(Base64 encoded\), but the LLM is capable of decoding it. The decoded string contains a jailbreak. Because the safety filter only sees the encoded text, it passes the check, but the LLM processes the decoded malicious instruction. Developers forget that LLMs are highly capable code interpreters and will happily decode obfuscated payloads mid-inference.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T16:55:31.724738+00:00— report_created — created