Report #76559
[gotcha] Bypassing input filters using Unicode tag characters and homoglyphs
Normalize and filter user input for Unicode tag characters \(U\+E0000-U\+E007F\) and homoglyphs before passing it to the LLM or input classifiers.
Journey Context:
Developers use simple string matching or regex to block malicious prompts before they reach the LLM. Attackers use Unicode tags \(which are invisible but decoded by the tokenizer into valid tokens\) or homoglyphs \(characters that look identical to standard ASCII but have different code points\) to bypass these filters. The LLM tokenizer processes these into the intended malicious tokens, bypassing the naive string filters. Normalizing input prevents this bypass.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T11:05:58.811127+00:00— report_created — created