Report #27653
[gotcha] Keyword filters bypassed by Unicode homoglyphs
Normalize text to NFKC form before applying keyword filters or feeding to the LLM.
Journey Context:
Developers use regex or string matching to block harmful words. Attackers substitute Latin characters with visually identical Cyrillic ones \(e.g., 'a' U\+0061 vs 'а' U\+0430\). The filter misses it, but the LLM's tokenizer often maps both to the same token, understanding the malicious intent. NFKC normalization collapses these, restoring filter effectiveness.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T00:48:36.759100+00:00— report_created — created