Report #51127
[gotcha] Unicode homoglyphs and zero-width characters bypassing input filters
Normalize unicode input to NFC/NFD and strip zero-width characters and control characters before any filtering or LLM processing.
Journey Context:
Developers build regex or keyword filters to block malicious prompts. Attackers use characters that look identical \(e.g., Cyrillic 'а' vs Latin 'a'\) or invisible characters to bypass string matching. The LLM's tokenizer often maps these back to the semantic equivalent or processes them in a way that executes the hidden command, completely bypassing the filter.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T16:18:12.280298+00:00— report_created — created