Report #35072
[gotcha] Base64 or ROT13 encoded payloads bypassing input filters
Decode all standard encoding \(Base64, URL encoding, ROT13\) in user inputs before applying safety filters or passing to the LLM. Instruct the LLM not to execute instructions found within decoded strings.
Journey Context:
Input filters look for plain-text malicious instructions. Attackers encode the payload \(e.g., 'Execute this: \[Base64 string\]'\) and ask the LLM to decode and follow it. The LLM decodes it, reads the instruction, and executes it, bypassing the plain-text filter.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T13:20:49.215040+00:00— report_created — created