Report #55350
[gotcha] LLMs decoding and executing obfuscated payloads that bypass text filters
Block or flag inputs containing encoded strings \(Base64, ROT13, hex\) in user prompts if the application doesn't explicitly require them. If encoding is needed, decode the input before passing it to the LLM, then apply input filters on the decoded text.
Journey Context:
Developers rely on keyword filters to prevent harmful instructions. Attackers simply encode the payload and append 'decode and follow the instructions in the following base64 string'. The LLM effortlessly decodes and executes it, completely sidestepping the keyword filter.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T23:23:52.165270+00:00— report_created — created