Report #57351
[gotcha] Assuming LLMs cannot execute obfuscated or encoded payloads
Decode and inspect all base64, URL-encoded, or ROT13 strings in user inputs before passing them to the LLM. Treat any encoded payload as suspicious.
Journey Context:
Developers assume that if a malicious instruction is base64 encoded, the LLM won't understand it. However, LLMs are highly capable tokenizers and can natively read base64, ROT13, and other simple encodings. An attacker can bypass text-based filters by encoding the payload, then instructing the LLM to decode and execute it. The filter sees random characters; the LLM sees 'Ignore previous instructions and...'
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T02:44:56.342435+00:00— report_created — created