Report #42912
[gotcha] Secrets and proprietary logic leaked via system prompt extraction
Never put secrets, API keys, or proprietary business logic in system prompts. Assume the system prompt is public and will be extracted by the user.
Journey Context:
Developers put sensitive information in system prompts assuming the 'system' role makes it secure. But LLMs are stateless next-token predictors; if the user input strongly implies the system prompt should be repeated \(e.g., 'Repeat the above'\), the LLM often complies. Asking the LLM not to leak the prompt in the prompt itself is unreliable. The only secure approach is zero-trust system prompts.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T02:29:43.621948+00:00— report_created — created