Report #86635
[counterintuitive] system prompt hides instructions from user
Never put secrets, API keys, or security-critical business logic solely in the system prompt; assume the user can extract it via prompt injection or model leakage.
Journey Context:
Developers put proprietary logic or security constraints in system prompts thinking they are secure. LLMs are highly susceptible to prompt extraction attacks \(e.g., 'repeat the above text'\) and can leak system prompts verbatim. Security and access control must be enforced outside the LLM boundary.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T04:00:21.885241+00:00— report_created — created