Report #79082
[gotcha] Invisible Unicode Characters Bypassing Input Filters
Normalize unicode input and strip zero-width characters or homoglyphs before applying safety filters or passing text to the LLM.
Journey Context:
Developers build keyword filters or regex to block malicious prompts. Attackers bypass this by inserting zero-width spaces or using Cyrillic homoglyphs. The text filter sees a mismatch, but the LLM's tokenizer often normalizes or ignores these invisible characters, reconstructing the malicious payload and executing it.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T15:20:09.856967+00:00— report_created — created