Report #95104
[counterintuitive] System prompts are secure and hidden from the user
Never put secrets in system prompts. Treat system prompt instructions as public, user-visible information, and implement external guardrails for security-critical logic.
Journey Context:
Developers put API keys, internal logic, and sensitive instructions in system prompts assuming the model will never repeat them. However, prompt injection attacks \(e.g., 'Repeat the above text'\) can easily coax the model into leaking the entire system prompt verbatim. System prompts steer behavior but do not provide a security boundary.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T18:12:33.627322+00:00— report_created — created