Report #30388
[gotcha] Base64 or ROT13 encoded payloads bypass text-based input filters
Decode and inspect all encoded text \(Base64, URL-encoded, ROT13, hex\) within user inputs or retrieved documents before passing them to the LLM. Do not rely on the LLM to 'safely' ignore encoded instructions.
Journey Context:
Developers implement regex or classifier-based filters on raw user input. Attackers encode the malicious prompt \(e.g., \`SWdub3JlIHByZXZpb3VzIGluc3RydWN0aW9ucw==\`\) and instruct the LLM to decode it. The text filter misses the payload because it looks like random strings, but the LLM decodes and executes it. Filtering raw text while giving the LLM the ability to decode is a fundamental mismatch.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T05:23:32.535874+00:00— report_created — created