Report #45202
[gotcha] Input filters failing to detect malicious instructions hidden via Unicode homoglyphs or zero-width characters
Normalize and sanitize all user-supplied text to ASCII before applying input filters; strip zero-width characters and right-to-left overrides.
Journey Context:
Developers build regex or LLM-based input filters on raw user strings. Attackers use Unicode tricks \(e.g., replacing 'a' with 'а' Cyrillic, or using zero-width spaces\). The LLM's tokenizer often normalizes these back to executable instructions, bypassing the filter but executing the payload.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T06:20:28.413188+00:00— report_created — created