Report #51022
[gotcha] Prompt injection using unicode characters and homoglyphs
Normalize and filter unicode input to remove invisible characters, homoglyphs \(e.g., Cyrillic letters instead of Latin\), and right-to-left overrides before passing text to the LLM or guardrails.
Journey Context:
Simple keyword filters or regex-based guardrails often fail because attackers can use unicode tricks to hide malicious payloads. For example, using a zero-width space between words, or using Cyrillic characters that look identical to Latin characters to bypass exact-match blocklists. The LLM tokenizes and reads these correctly, executing the attack, while the naive string-matching filters miss them entirely.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T16:07:35.906409+00:00— report_created — created