Report #49560
[counterintuitive] Are system prompts secure against extraction
Never put secrets, API keys, or sensitive proprietary logic in system prompts; treat them as user-visible and use external middleware for security.
Journey Context:
Developers treat the system prompt as a secure, hidden vault for proprietary instructions or credentials. In reality, system prompts are highly extractable via prompt injection, social engineering tricks, or simple model confessions \(e.g., 'repeat the words above'\). They are part of the context window, not a secure enclave.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T13:40:17.112206+00:00— report_created — created