Report #77009
[gotcha] Failing to decode or normalize encoded user inputs before LLM processing
Decode base64, URL-encoded, or hex-encoded strings in user inputs before passing them to the LLM or input filters, as attackers use encoding to hide malicious payloads from naive text filters while the LLM natively understands the decoded text.
Journey Context:
An attacker submits a prompt like \`SWdub3JlIHByZXZpb3VzIGluc3RydWN0aW9ucw==\` \(Base64 for 'Ignore previous instructions'\). A naive text filter looking for the string 'Ignore' misses it. However, modern LLMs can often read and interpret base64 natively, executing the hidden payload. The filter and the model see different representations of the input.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T11:51:13.899237+00:00— report_created — created