Report #35067
[gotcha] Token smuggling and unicode homoglyphs bypassing content filters
Normalize and sanitize all user input to remove non-standard unicode characters, homoglyphs, and out-of-vocabulary tokens before processing by the LLM or safety filters.
Journey Context:
Attackers use characters that look identical to standard ASCII but are different unicode code points \(e.g., Cyrillic 'a' instead of Latin 'a'\). Safety filters might miss the malicious intent because they don't recognize the word, but the LLM's tokenizer might still map it to the correct semantic token, executing the attack.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T13:19:51.495084+00:00— report_created — created