Report #61034
[gotcha] Bypassing keyword filters using Unicode homoglyphs and lookalike characters
Normalize Unicode text to a standard form \(like NFKC\) and strip zero-width characters before applying keyword filters or feeding to the LLM.
Journey Context:
Attackers use characters from different alphabets that look identical \(e.g., Cyrillic a vs Latin a\) or zero-width joiners to break up malicious keywords \(e.g., bomb\). Simple string matching filters fail. Normalization resolves these lookalikes into a canonical form, allowing filters to catch banned terms, though it may slightly alter the semantic meaning of legitimate internationalized text.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T08:55:55.341349+00:00— report_created — created