Report #57067
[counterintuitive] Can I safely put secrets or strict rules in the system prompt
Never put secrets in system prompts; implement external guardrails for security rules, as system prompts can be exfiltrated via prompt injection.
Journey Context:
Developers treat the system prompt as a secure, hidden space, assuming the model will never reveal it. In reality, LLMs can be manipulated via prompt injection to repeat or 'leak' their system prompts. Furthermore, instructions in system prompts are prioritized, but not absolute; a sufficiently strong user prompt can override them. Security and access control must be enforced outside the LLM.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T02:16:38.920110+00:00— report_created — created