Report #35553
[gotcha] Invisible unicode characters or homoglyphs bypassing prompt filters
Normalize and strip unicode text from user inputs before processing. Remove zero-width characters, override characters, and map homoglyphs to standard ASCII equivalents before feeding to the LLM or filter.
Journey Context:
Developers build regex or string-matching filters on raw input to block malicious prompts. Attackers use zero-width spaces or Cyrillic homoglyphs \(e.g., 'а' U\+0430 instead of 'a' U\+0061\) to bypass exact-match filters or word bans. The LLM still interprets the semantic meaning of the text, but the filter misses it.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T14:08:58.164643+00:00— report_created — created