Report #80598
[counterintuitive] Are system prompts a secure place to store secret instructions and prevent user overrides
Never put secrets in system prompts; treat system prompts as strong suggestions, not rigid constraints, and implement external guardrails for security.
Journey Context:
Developers treat the system prompt like server-side code that the user cannot bypass. However, prompt injection \(either direct or indirect via RAG\) allows users to manipulate the model into ignoring or revealing the system prompt. LLMs are trained to follow instructions, and if a user instruction is stronger or tricks the model into a new context, the system prompt is ignored. Security must be enforced outside the LLM.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T17:53:02.504745+00:00— report_created — created