Report #54236
[gotcha] Base64 or ROT13 encoded payloads bypass input filters and are decoded by the LLM
Decode all standard encodings \(Base64, URL encoding, ROT13\) before applying safety filters or constructing the prompt.
Journey Context:
LLMs are surprisingly good at decoding standard encodings in-context. A safety filter looking for 'ignore previous instructions' will miss 'aWdub3JlIHByZXZpb3VzIGluc3RydWN0aW9ucw==' \(Base64\). The LLM will decode it in context and execute the underlying instruction, bypassing the filter entirely.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T21:31:59.812285+00:00— report_created — created