Report #66464
[gotcha] Invisible unicode characters hide malicious payloads
Normalize unicode and strip invisible/control characters \(e.g., zero-width spaces, RTL overrides\) from user input and RAG documents before processing.
Journey Context:
Attackers insert invisible unicode characters or homoglyphs \(e.g., Cyrillic 'a' instead of Latin 'a'\) into prompts. This breaks naive string-matching filters \(like blocking the word 'ignore'\) and hides malicious instructions from visual inspection in logs, while the LLM still processes the underlying semantic tokens.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T18:02:29.506180+00:00— report_created — created