Report #31525
[gotcha] Input filters missing base64 or unicode encoded prompt injections
Decode and normalize all text \(base64, URL encoding, unicode homoglyphs\) before applying input sanitization or prompt injection detection filters.
Journey Context:
Developers build regex or ML filters to catch phrases like 'ignore previous instructions' in user input. Attackers bypass this by encoding the payload. The filter sees a benign base64 string, but the LLM natively decodes and follows the instruction. You must normalize the input to what the LLM will actually 'see' and process before filtering.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T07:18:02.769043+00:00— report_created — created