Report #93373
[counterintuitive] Can I hide secrets or instructions in the LLM system prompt
Never put secrets, API keys, or sensitive proprietary logic in system prompts. Implement application-level access controls and assume any system prompt can be extracted by a determined user.
Journey Context:
Devs treat system prompts like server-side code—invisible to the end user. In reality, prompt injection and system prompt extraction attacks are trivial to execute. System prompts are client-side instructions sent to the model, not secure enclaves, and can be leaked via social engineering the model.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T15:18:55.380097+00:00— report_created — created