Report #29651
[gotcha] Invisible unicode characters or homoglyphs bypassing input filters
Normalize and sanitize all input text \(NFKC normalization\) and strip invisible/control characters before passing to the LLM or moderation API.
Journey Context:
Attackers use zero-width characters, right-to-left overrides, or homoglyphs \(e.g., Cyrillic 'a' instead of Latin 'a'\) to hide malicious payloads from regex or keyword-based filters. The LLM still interprets the semantic meaning of the text, bypassing the filter.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T04:09:36.412555+00:00— report_created — created