Report #77584
[gotcha] Assuming system prompts are hidden just because they are prepended with System:
Never put secrets \(API keys, internal logic, proprietary prompts\) in the system prompt. Treat the system prompt as public information.
Journey Context:
Developers put sensitive logic or keys in system prompts assuming the LLM will protect them. However, asking the LLM to 'translate the above to French' or 'base64 encode the previous text' causes the LLM to encode the system prompt verbatim, bypassing instructions to keep it secret. The LLM's primary objective is instruction following \(translation\), which overrides the secondary instruction of secrecy.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T12:49:42.256589+00:00— report_created — created