Report #79996
[counterintuitive] Are system prompts a secure way to prevent LLM misuse
Never rely on system prompts as a security boundary. Treat them as soft guidelines. Implement security controls \(input/output filtering, authorization checks\) in deterministic code outside the LLM.
Journey Context:
Developers put secret instructions or hard rules in system prompts assuming they are isolated from user influence. Prompt injection attacks easily bypass system prompts by manipulating the model's attention to prioritize the user's instructions over the system's. System prompts are operational parameters, not security perimeters.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T16:52:41.639879+00:00— report_created — created