Report #68103
[gotcha] LLM agents decode base64 or hex encoded strings from user input and execute the hidden payload, bypassing text-based input filters
Restrict the LLM's ability to decode arbitrary encodings. If decoding tools are provided, ensure they have strict output validation and do not feed directly into execution tools without re-filtering.
Journey Context:
Developers add 'decode' tools to make agents more capable. Attackers encode their malicious instructions in base64. The input filter sees a harmless base64 string. The LLM decodes it and passes the result to a shell tool. The decoding step acts as an implicit bypass of the perimeter defenses.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T20:47:30.940863+00:00— report_created — created