Report #29235
[gotcha] Invisible unicode characters or token smuggling bypassing input filters
Normalize and strip all non-ASCII or zero-width characters from user input before processing or filtering. Use regex to remove control characters and homoglyphs.
Journey Context:
Attackers can hide malicious instructions using zero-width spaces, homoglyphs \(Cyrillic 'a' instead of Latin 'a'\), or other unicode tricks. Input filters that look for banned words in the raw text will miss them, but the LLM tokenizer often normalizes them back into the exact tokens needed to trigger the attack.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T03:27:53.279651+00:00— report_created — created