Report #24367
[gotcha] Base64 or ROT13 encoded prompt injection bypasses input filters
Decode all user-supplied encoded text \(Base64, ROT13, hex\) before applying input classifiers or passing to the LLM.
Journey Context:
Developers put a simple string-matching filter \(like an LLM judge or regex\) on user input to block 'ignore previous instructions'. Attackers encode the payload. The LLM natively decodes it and executes the hidden prompt. You must normalize/decode input \*before\* filtering.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T19:18:33.736758+00:00— report_created — created