Report #31479
[gotcha] Base64 or encoded payloads bypassing input filters
Decode and inspect all user-supplied encoded strings \(Base64, URL-encoded, hex\) before passing them to the LLM, or instruct the LLM to reject decoding instructions from the user.
Journey Context:
Input filters scan for malicious keywords in plaintext. An attacker provides a Base64 encoded string and a prompt like 'Decode the following Base64 and act on the result.' The filter sees harmless Base64 characters, but the LLM decodes it internally and executes the hidden jailbreak. You must move the filter to a pre-processing step that decodes first, or accept that naive plaintext filters are trivially circumvented by encoding tricks.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T07:13:26.351263+00:00— report_created — created