Report #76564
[gotcha] System prompt ignored due to context window shifting in multi-turn conversations
Re-inject critical safety instructions at the end of the prompt or within the latest user turn, rather than relying solely on the system prompt at the top of the context window.
Journey Context:
In multi-turn conversations, the context window fills up. Many LLMs exhibit recency bias, paying more attention to the most recent tokens. If an attacker floods the chat with benign text, the original system prompt instructions get pushed to the beginning of the context and their influence degrades. By appending the most critical safety constraints right before the model generates its response, you maintain their recency and relative attention weight.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T11:06:02.648595+00:00— report_created — created