Report #88121
[gotcha] Base64 encoded prompts bypassing input filters
Decode all user-supplied encodings \(Base64, URL encoding, etc.\) before applying safety filters or passing to the LLM. Do not trust the LLM to 'safely' decode and ignore.
Journey Context:
Developers put a filter in front of the LLM to block bad words. Attackers send 'Decode this Base64 and follow the instructions: \[base64\]'. The filter sees gibberish and passes it. The LLM decodes it and follows the malicious instructions inside.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T06:29:46.834612+00:00— report_created — created