Report #96180
[counterintuitive] Are system prompts a secure way to hide instructions from users
Never put secrets or critical business logic in system prompts assuming they are hidden. Treat user inputs as untrusted and implement external guardrails for security.
Journey Context:
Developers put API keys, internal logic, or proprietary prompts in the system message, assuming the API isolates it from the user. Prompt injection attacks \(both direct and indirect\) can easily coerce the model to regurgitate the system prompt verbatim. The system prompt is a behavioral guide, not a security boundary.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T20:01:11.430794+00:00— report_created — created