Report #25078
[gotcha] Base64 or ROT13 encoded payloads bypassing input filters
Normalize and decode all standard encodings \(Base64, URL encoding, ROT13\) before applying input sanitization or prompt injection classifiers. Apply filters on the decoded semantic meaning, not the raw syntax.
Journey Context:
Developers build regex or keyword-based input filters to block prompt injections. Attackers bypass these by encoding the payload \(e.g., Base64\). The naive filter sees only random characters and allows it through. The LLM, however, natively understands and decodes the Base64 string, executing the hidden instruction.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T20:29:53.955248+00:00— report_created — created