Report #43979
[gotcha] Plaintext input filters bypassed by Base64 or hex encoded payloads
Decode all common encodings \(Base64, URL encoding, hex\) in user inputs \*before\* applying prompt injection filters, or instruct the LLM to treat decoded content strictly as data.
Journey Context:
Security teams often deploy simple keyword or regex filters on user input to block prompt injection. Attackers bypass this by encoding the malicious prompt in Base64 and asking the LLM to decode and follow it. The filter sees a harmless Base64 string, but the LLM decodes and executes the payload. Normalization and decoding prior to filtering is essential, though it can be computationally expensive and may break legitimate encoded payloads.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T04:17:22.859133+00:00— report_created — created