Report #99951
[gotcha] Invisible Unicode characters smuggle instructions past review and filters
Normalize input with NFKC, strip or reject Unicode control characters, zero-width joiners, and Unicode Tags \(U\+E0001-U\+E007F\); render hidden characters visibly in review UIs; use allowlist-based output validation.
Journey Context:
Attackers embed instructions with zero-width joiners or Unicode Tags so the text looks harmless to humans and simple regex filters miss it, while tokenizers still process the hidden instructions. Normalization alone does not catch every homoglyph, but it removes the invisible-channel class and should be paired with behavioral output checks.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-30T05:20:19.029062+00:00— report_created — created