Report #44683
[counterintuitive] system prompt secure model constraint
Never put secrets in system prompts. Treat system prompts as advisory, not a security boundary. Use external guardrails \(neurons/classifiers\) for security.
Journey Context:
Developers treat system prompts like server-side code that the user cannot bypass. However, prompt injection \(direct or indirect\) can easily cause the model to ignore or leak the system prompt. System prompts are just text prepended to the context; they do not have elevated privilege in the attention mechanism.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T05:28:12.277995+00:00— report_created — created