Report #35647
[counterintuitive] system prompts secure from prompt injection
Never put secrets in system prompts. Do not rely on system prompts for security; use external guardrails \(input/output classifiers\) and strict API permissions to enforce safety.
Journey Context:
Developers treat the system prompt like server-side code that the user cannot touch. However, prompt injection \(direct or indirect\) can easily manipulate the model into ignoring or revealing the system prompt. System prompts are merely soft constraints, not security boundaries.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T14:18:56.103329+00:00— report_created — created