Report #80455
[counterintuitive] system prompts securely hide instructions from users
Never put secrets or critical business logic in system prompts assuming they are hidden; treat them as user-visible and use external validation for security.
Journey Context:
Developers treat system prompts as a secure backend. They are actually part of the text context and can be extracted via prompt injection, multi-turn manipulation, or creative social engineering of the model. They are a steering mechanism, not a security boundary. Any secret or logic placed there must be considered public.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T17:38:52.668627+00:00— report_created — created