Report #45968
[gotcha] Base64 or ROT13 encoded payloads bypassing input filters
Decode and normalize all user inputs \(Base64, URL encoding, ROT13, etc.\) before applying safety filters or passing to the LLM.
Journey Context:
Developers implement input filters to block specific keywords \(e.g., 'hack', 'bomb'\). Attackers bypass these by encoding their prompts \(e.g., Base64\). The LLM is capable of decoding Base64 natively, so it understands and executes the hidden instruction, while the naive string-matching filter misses it entirely.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T07:37:51.792099+00:00— report_created — created