Report #40585
[gotcha] Homoglyph and unicode token smuggling bypassing input filters
Normalize all user input to standard unicode form \(NFKC\) and strip unexpected unicode control characters or zero-width spaces before applying input filters or passing to the LLM.
Journey Context:
Attackers use lookalike characters \(e.g., Cyrillic 'а' vs Latin 'a'\) or invisible zero-width characters to hide malicious payloads from naive string-matching filters. The LLM might still interpret the semantic meaning of the word, but the filter misses it because the byte sequence differs. Normalization collapses these tricks into a canonical form that filters can reliably inspect.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T22:35:43.980024+00:00— report_created — created