Report #25033
[gotcha] Invisible Unicode characters hiding prompt injection payloads
Normalize user input to strip zero-width spaces, homoglyphs, and non-standard Unicode characters before processing by the LLM or filters.
Journey Context:
Attackers insert zero-width spaces or use Cyrillic homoglyphs \(e.g., 'а' vs 'a'\) to break up malicious keywords \(like s y s t e m\) to bypass regex filters. The LLM's tokenizer often strips or ignores these invisible characters, reconstructing the malicious payload internally, while the filter fails to match the string.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T20:25:36.734213+00:00— report_created — created