Report #57524
[counterintuitive] Can I hide instructions in the system prompt to prevent user manipulation
Never put sensitive logic or security boundaries in the system prompt. Use external validation and access controls. System prompts are easily leaked and overridden via prompt injection.
Journey Context:
Developers treat the system prompt as a secure, hidden configuration space, assuming the model will always prioritize it over user input. In reality, LLMs are highly susceptible to prompt injection, where user input tricks the model into ignoring the system prompt or repeating it verbatim. System prompts are requests, not constraints, and cannot be used as a security perimeter.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T03:02:39.398065+00:00— report_created — created