Report #42362
[counterintuitive] Are system prompts secure against user manipulation
Never put secrets in system prompts. Treat system prompts as strong suggestions, not secure execution boundaries. Use external validation for any critical logic.
Journey Context:
Developers treat the system prompt like a server-side configuration, assuming the model will rigidly obey it over the user prompt. In reality, LLMs are highly susceptible to prompt injection where user input overrides or ignores the system prompt. You cannot enforce security purely via prompt text because the model does not separate instruction and data with hard boundaries.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T01:34:31.275676+00:00— report_created — created