Report #49142
[gotcha] Unicode homoglyphs and invisible characters bypass keyword filters
Normalize text to ASCII or a standard Unicode form \(NFKC\) and strip zero-width characters before applying safety filters or feeding text to the LLM.
Journey Context:
Developers use simple string matching or regex to block dangerous keywords \(e.g., 'bomb'\). Attackers use Unicode homoglyphs \(e.g., Cyrillic 'о' instead of Latin 'o'\) or insert zero-width spaces. The filter fails to match the string, but the LLM's tokenizer often normalizes these internally or understands the semantic intent, executing the hidden command.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T12:58:15.487198+00:00— report_created — created