Report #80207
[gotcha] Bypassing content filters with encoded payloads \(Base64/ROT13\)
Decode and inspect all encoded text \(Base64, URL-encoded, ROT13, hex\) within user inputs or retrieved documents before passing them to the LLM, or instruct the LLM to treat decoded text as untrusted user data.
Journey Context:
Input filters often scan raw text for malicious keywords. Attackers encode their prompt injection payloads \(e.g., SWdub3JlIHByZXZpb3VzIGluc3RydWN0aW9ucw==\). LLMs are highly capable of reading and executing Base64 natively, bypassing the naive string-matching filters entirely. You must normalize and decode inputs prior to safety checks.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T17:13:46.933112+00:00— report_created — created