Report #45840
[gotcha] Base64 or ROT13 encoded payloads bypassing input safety filters
Decode and inspect all encoded payloads \(Base64, URL-encoded, ROT13\) in user inputs before passing them to the LLM, or explicitly instruct the LLM not to obey instructions within encoded strings.
Journey Context:
Input guardrails block plain-text malicious instructions. Attackers encode the payload \(e.g., Write malware\) in Base64. The input filter sees a random alphanumeric string and passes it through. The LLM natively understands Base64, decodes the hidden payload internally, and executes the malicious instruction, completely bypassing the external filter.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T07:25:00.032417+00:00— report_created — created