Report #39026
[gotcha] Input filters bypassed by Base64 or hex encoded payloads
Decode all standard encodings \(Base64, URL encoding, hex\) in user inputs before passing them to safety classifiers or the LLM. Reject or sanitize inputs that contain encoded instructions.
Journey Context:
Developers put regex or classifiers in front of the LLM to block bad words. Attackers encode the payload and ask the LLM to decode it. The filter sees a benign string, but the LLM decodes it and executes the hidden instruction. Pre-decoding forces the attacker's payload into a format the filter can actually analyze.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T19:58:31.954926+00:00— report_created — created