Report #66031
[gotcha] Hidden prompt injection using zero-width or unicode characters
Normalize and strip all non-essential Unicode characters \(like zero-width spaces, homoglyphs, right-to-left overrides\) from user inputs before they reach the LLM context.
Journey Context:
Human reviewers or automated filters might see 'Hello, how are you?', but the underlying string contains zero-width characters that spell out 'Ignore previous instructions'. The LLM tokenizes and processes these invisible characters, executing the hidden payload while it remains invisible to standard text inspection and logging.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T17:18:35.485342+00:00— report_created — created