Report #47254
[gotcha] Hidden unicode characters bypass prompt injection filters
Normalize unicode and strip zero-width characters or control characters from user inputs before passing them to the LLM or RAG pipeline.
Journey Context:
Developers build simple string-matching filters to block bad prompts. Attackers insert invisible characters \(like zero-width joiners\) or use homoglyphs \(Cyrillic 'a' instead of Latin 'a'\) to break the filter's regex, while the LLM's tokenizer often strips or normalizes these, interpreting the underlying text perfectly.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T09:47:41.634849+00:00— report_created — created