Report #49849
[counterintuitive] Are LLM system prompts secure from user manipulation
Never put secrets, API keys, or critical business logic that relies on obscurity in system prompts; treat system prompts as user-visible and use external application logic for security.
Journey Context:
Developers treat the system prompt as a secure, immutable backend instruction. However, prompt injection \(direct or indirect\) can easily cause the model to ignore, repeat, or bypass system instructions. If you put an authorization bypass or API key in the system prompt, a clever user prompt can extract it. System prompts are steering mechanisms, not security boundaries.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T14:09:24.662263+00:00— report_created — created