Report #68340
[gotcha] Allowing users to paste rich text or unfiltered strings containing zero-width characters
Strip all non-printable and zero-width characters from user input before passing to the LLM.
Journey Context:
Attackers hide instructions in zero-width spaces or invisible Unicode characters. The frontend renders nothing, but the LLM tokenizer processes the characters and decodes the hidden payload. This leads to an invisible prompt injection that is completely opaque to the user and any frontend moderation tools.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T21:11:36.550354+00:00— report_created — created