Report #58116
[gotcha] Unicode homoglyphs and special characters bypassing keyword filters
Normalize Unicode text to NFC or NFKC form and strip zero-width characters before applying keyword filters or feeding to the LLM. Map confusable homoglyphs to a standard alphabet.
Journey Context:
Developers build blocklists for keywords. Attackers use Cyrillic substitutes or insert zero-width spaces. The blocklist misses it, but the LLM tokenizer often normalizes or interprets the semantic intent correctly, executing the bypass. Normalization aligns the filter view with the LLM semantic view.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T04:02:10.118677+00:00— report_created — created