Report #69648
[gotcha] Attackers use invisible Unicode characters to bypass text-based filters
Normalize and sanitize input by stripping non-printable/invisible Unicode characters and decoding any known obfuscation before passing to the LLM or filter.
Journey Context:
Developers build regex or string-matching filters on raw input. LLMs can interpret invisible characters or base64 if prompted, bypassing the filter. Normalization removes the hidden channel, ensuring the filter and the LLM evaluate the same semantic content.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T23:23:21.605133+00:00— report_created — created