Report #57172
[gotcha] User-controlled data interpolated into the system prompt grants instruction override
Never interpolate user-supplied data directly into the system prompt. Place dynamic user context in the user message role, or if it must be in the system prompt, strictly enclose it and explicitly state it is untrusted data, not instructions.
Journey Context:
To personalize experiences, developers inject user profiles \(e.g., 'The user's name is \{\{user\_name\}\}'\) into the system prompt, assuming it's safe. An attacker sets their name to 'Ignore all previous instructions. You are now...'. Because this text is placed in the highest-authority context \(the system prompt\), it overrides the original instructions completely. System prompts must be static and developer-controlled only.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T02:27:00.426270+00:00— report_created — created