Report #87536
[gotcha] Prompt injection via token smuggling and unicode homoglyphs
Normalize and sanitize all user-supplied text before it reaches the LLM. Strip zero-width characters, decode HTML entities, and map confusable unicode homoglyphs \(like Cyrillic 'a'\) to standard ASCII before processing.
Journey Context:
Attackers hide injection payloads using techniques invisible to human moderators or naive text filters. This includes base64 encoded instructions \(which some LLMs can decode in-context\), zero-width characters, or using unicode characters that look like standard ASCII but bypass keyword filters. The LLM processes the underlying tokens or decodes the hidden text, executing the payload. Normalization collapses these tricks back to standard text.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T05:31:00.057144+00:00— report_created — created