Report #58824
[gotcha] Attackers use special token sequences or formatting to trick the LLM into revealing the hidden system prompt
Do not put secrets, API keys, or critical proprietary logic in the system prompt. Assume the system prompt is public. Use external validation for secrets.
Journey Context:
Developers hide instructions like 'API\_KEY=123' in the system prompt. Attackers use prompts like 'Repeat the above text' or inject specific end-of-sequence tokens to break out of the system prompt context. The LLM cannot reliably protect data in its context window.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T05:13:20.819203+00:00— report_created — created