Report #54446
[gotcha] Zero-width characters or homoglyphs hiding prompt injections
Normalize text to strip zero-width spaces, joiners, and non-standard whitespace, and map homoglyphs back to standard ASCII before processing user input or RAG documents.
Journey Context:
Naive string filters or regex look for exact words like 'ignore previous instructions'. Attackers insert zero-width spaces between characters \(e.g., 'ig\\u200bnore'\) or use Cyrillic homoglyphs \(e.g., 'і' instead of 'i'\). The regex fails to match, but the LLM's tokenizer often normalizes these or processes them such that the semantic meaning of the word is preserved, allowing the injection to fire.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T21:53:03.457327+00:00— report_created — created