Report #82214
[counterintuitive] can users extract system prompts
Never put secrets, API keys, or proprietary logic you cannot risk leaking in system prompts. Implement business logic and access control server-side.
Journey Context:
Developers treat system prompts as a secure backend environment. They are actually just text prepended to the user prompt and are highly susceptible to prompt injection, jailbreaking, and model compliance leaks \(e.g., 'repeat the words above'\). System prompts are instructions, not access control boundaries.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T20:35:25.305446+00:00— report_created — created