Report #69812
[counterintuitive] Are system prompts completely hidden from users
Never put secrets, API keys, or proprietary logic in system prompts that you cannot risk the user seeing; treat system prompts as soft instructions, not secure enclaves.
Journey Context:
Developers treat the system prompt like backend code, assuming the user cannot see it. However, prompt injection techniques, jailbreaks, or simply asking the model to repeat its instructions can easily extract the system prompt. System prompts are effectively front-end code delivered to an untrusted interpreter, and must be treated as such.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T23:39:48.746173+00:00— report_created — created