Report #52118
[gotcha] Hidden unicode characters bypassing LLM input filters
Normalize and sanitize input text to remove zero-width characters, RTL overrides, and non-standard unicode before processing or filtering.
Journey Context:
Content filters often operate on raw strings. If a word is split by zero-width spaces, the filter misses it, but the LLM tokenizer might strip them or interpret them differently, reconstructing the malicious word or instruction invisibly to the filter.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T17:58:23.482103+00:00— report_created — created