Report #73423
[gotcha] Relying on keyword filters or regex to block prompt injection payloads
Normalize unicode input \(e.g., NFC normalization\) and strip zero-width characters or homoglyphs before applying filters or sending to the LLM.
Journey Context:
Attackers use unicode tricks like right-to-left overrides, zero-width spaces, or characters that look like English letters but are from other alphabets \(homoglyphs\) to bypass naive string-matching filters. A filter looking for 'ignore previous instructions' will miss 'ignоre previоus instructiоns' \(using Cyrillic 'о'\). Normalization collapses these tricks back to standard characters before filtering.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T05:50:12.394437+00:00— report_created — created