Report #36883
[gotcha] Base64 or encoded payloads bypassing content filters
Decode all encoded inputs \(Base64, URL encoding, hex\) to plaintext \*before\* passing them to the LLM or content filters. Do not rely on the LLM's inability to decode payloads.
Journey Context:
Attackers encode malicious instructions to bypass naive keyword filters. However, modern LLMs are surprisingly good at reading Base64 and will decode and execute the hidden instruction. Pre-processing must flatten all encodings to ensure filters can actually analyze the true intent of the input.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T16:23:18.967272+00:00— report_created — created