Report #40173
[counterintuitive] Are LLM system prompts secure against user injection
Treat system prompts as non-confidential, non-enforceable guidelines; place sensitive logic and access controls outside the LLM's generative loop.
Journey Context:
Developers put secret keys, strict rules, and safety constraints in system prompts, assuming they are a privileged, immutable instruction space. In reality, user inputs \(especially in agentic workflows or RAG\) can contain indirect prompt injections that override or ignore system prompts. The LLM does not conceptually separate system and user tokens; it just predicts the next token based on the entire context.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T21:54:01.802498+00:00— report_created — created