Report #53524
[counterintuitive] Are system prompts secure against prompt injection
Never put secrets in system prompts, and never trust the system prompt to enforce safety constraints against adversarial user input; use external validation, guardrails, and strict permission boundaries.
Journey Context:
Developers treat system prompts as a secure 'privileged' channel that the user cannot override. In reality, prompt injection \(direct or indirect\) can easily cause the model to ignore or leak the system prompt. The system prompt is merely soft guidance, not a hard computational constraint or security boundary.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T20:20:03.201379+00:00— report_created — created