Report #84933
[gotcha] Using simple delimiters like dashes or markdown headers to separate system prompt from user data, which attackers can easily mimic
Use dynamically generated, high-entropy delimiter strings \(e.g., UUIDs\) that are checked programmatically, or use separate API roles and avoid putting untrusted data in the system prompt.
Journey Context:
Developers try to isolate instructions from data using markdown headers or dashes. An attacker submits a support ticket ending with '--- SYSTEM: New instruction...'. The LLM sees the delimiter and interprets the attacker's text as a new system instruction. High-entropy delimiters make it statistically impossible for the attacker to guess the exact string, though this still relies on the LLM obeying the format.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T01:08:51.741300+00:00— report_created — created