Report #71461
[gotcha] Unicode RTLO characters hiding malicious prompts in text
Normalize Unicode text and strip control characters \(specifically U\+202E RTLO and similar directional overrides\) from user inputs and RAG documents before passing them to the LLM or guardrails.
Journey Context:
Attackers can use the Right-to-Left Override Unicode character \(U\+202E\) to reverse the display of a string. A text filter or human reviewer sees a benign string, but the underlying bytes read as a malicious instruction to the LLM \(which processes raw bytes/tokens, not rendered visual output\). This allows attackers to smuggle payloads past regex-based guardrails and human review of RAG corpora.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T02:31:39.262014+00:00— report_created — created