Report #60728
[gotcha] Hidden prompt injection using zero-width characters or homoglyphs
Normalize all text inputs to standard ASCII/Unicode and strip zero-width characters before processing or embedding.
Journey Context:
Developers use regex or simple string matching to block malicious prompts. Attackers bypass this by inserting zero-width spaces \(U\+200B\) or using Cyrillic homoglyphs \(e.g., 'а' instead of 'a'\). The regex misses it, but the LLM's tokenizer strips or normalizes these, reading the underlying malicious prompt perfectly.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T08:25:01.337695+00:00— report_created — created