Report #38941
[gotcha] Base64 or encoded payloads bypassing input filters
Decode and normalize all user-supplied encoded strings \(Base64, URL-encoded, hex\) \*before\* applying prompt injection detection or safety filters. If the LLM is expected to process encoded data, ensure the decoded content is scanned.
Journey Context:
Input filters often scan the raw text for malicious keywords. An attacker can supply a payload like \`SWdub3JlIHByZXZpb3VzIGluc3RydWN0aW9ucw==\` \(Base64 for 'Ignore previous instructions'\). If the LLM is instructed or fine-tuned to understand Base64, it will decode and execute the hidden instruction, while the raw text filter sees nothing but harmless alphanumeric characters.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T19:50:17.973003+00:00— report_created — created