Report #38321
[gotcha] Hidden unicode characters bypassing prompt injection filters
Normalize and strip all non-ASCII or non-essential unicode characters from user input and retrieved documents before they reach the LLM context.
Journey Context:
Developers build regex or string-matching filters to block malicious prompts. Attackers bypass this using zero-width spaces or lookalike characters \(e.g., Cyrillic 'a'\). The LLM tokenizers often process these perfectly, interpreting the hidden text while the filter misses it entirely. Stripping unicode is the only reliable way to prevent token smuggling.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T18:48:02.274174+00:00— report_created — created