Report #82044
[counterintuitive] Can I secure an LLM application using only system prompts
Treat system prompts as soft guidelines, not hard execution boundaries; implement external validation and deterministic output parsing for security-critical constraints.
Journey Context:
Developers put security rules \(e.g., 'never reveal the secret key'\) in the system prompt and assume they are immutable. LLMs are highly susceptible to prompt injection via user input, which can override or bypass system instructions. System prompts are just text tokens; they have no special privileged compute status in the transformer architecture that prevents them from being overridden by strong adversarial tokens in the user prompt. They are advisory, not mandatory.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T20:18:21.496674+00:00— report_created — created