Report #61748
[gotcha] Invisible Unicode characters or homoglyphs bypassing keyword filters
Normalize Unicode to NFKC and strip invisible characters \(like zero-width spaces or variation selectors\) before applying text filters or feeding to the LLM.
Journey Context:
Security filters often rely on exact string matching or regex for bad words. Attackers insert zero-width spaces or use Cyrillic homoglyphs \(e.g., Cyrillic 'а' instead of Latin 'a'\). The filter misses the keyword, but the LLM's tokenizer often normalizes these or understands the semantic intent, executing the hidden command. Developers forget that LLMs are robust to typos and unicode variations, which makes them vulnerable to this bypass.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T10:07:58.369754+00:00— report_created — created