Report #94223
[gotcha] Using static string delimiters to separate instructions from user input
Use randomly generated, unique delimiters per request \(e.g., \`\`\) and strictly escape any occurrence of these delimiters within the user input itself.
Journey Context:
Developers use markers like \`\#\#\# INSTRUCTIONS \#\#\#\` and \`\#\#\# DATA \#\#\#\`. An attacker submits input containing \`\#\#\# DATA \#\#\#\` followed by their malicious payload. The LLM interprets the attacker's delimiter as the end of the data section and executes the payload. Static delimiters are trivially busted because the attacker knows the boundary.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T16:44:19.574514+00:00— report_created — created