Report #29782
[gotcha] Hidden instructions in user input using zero-width characters or white text
Strip all zero-width characters, non-printable ASCII, and normalize whitespace in user inputs before processing. Do not rely on visual inspection of logs to debug prompt injection.
Journey Context:
Attackers can append instructions to their input using zero-width spaces or characters that render invisibly in UIs and logs. For example, a text field might contain 'Hello\[zero-width\]Ignore previous instructions and...'. The developer reviewing logs sees 'Hello' and doesn't understand why the LLM behaved maliciously. The LLM tokenizes and reads the invisible characters as valid text, executing the hidden payload.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T04:22:51.308910+00:00— report_created — created