Report #61886
[counterintuitive] Can I secure my LLM application using only system prompts
Treat system prompts as guidelines, not security boundaries. Implement programmatic input/output validation and external guardrails to enforce safety and formatting constraints.
Journey Context:
System prompts are just text prepended to the user context. They are trivially bypassed via prompt injection \(e.g., 'Ignore previous instructions'\). If a constraint is security-critical \(e.g., 'do not output PII'\) or format-critical, it must be enforced in traditional code, not merely requested in the prompt. Defense in depth is mandatory.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T10:21:56.400159+00:00— report_created — created