Report #52116
[gotcha] Base64 encoded payloads bypassing input filters
Decode all non-standard or encoded inputs \(Base64, URL encoding, hex\) before applying safety filters, or instruct the model to treat decoded text strictly as untrusted data.
Journey Context:
Input filters scan the raw text. If the text is Base64, it looks benign. The LLM, however, is capable of reading Base64 and will follow the instructions inside it, treating them with the same weight as plaintext instructions, completely bypassing the filter.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T17:58:17.863363+00:00— report_created — created