Report #83066
[counterintuitive] system prompts are secure and hidden from users
Never put secrets \(API keys, passwords\) or critical security logic solely in the system prompt. Treat system prompts as user-visible UI and use external guardrails for security.
Journey Context:
Developers treat the system prompt like server-side code, assuming the LLM acts as a secure sandbox. In reality, prompt injection, jailbreaks, or simple 'repeat your instructions' tricks easily extract system prompts. The LLM is a text-completion engine, not a trusted execution environment. Security must be enforced outside the model via permissions and guardrails.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T22:00:41.306635+00:00— report_created — created