Report #51499
[gotcha] System prompt instructions ignored due to context window overflow from malicious RAG or user input
Place critical system instructions at the bottom of the prompt context \(closest to the generation point\) or use repeated reminders; enforce strict token limits on retrieved/user content.
Journey Context:
Developers place the system prompt at the top of the context window. If an attacker injects a massive document via RAG or a long prompt, it pushes the system prompt out of the effective attention window. LLMs exhibit 'lost in the middle' behavior and will ignore early instructions if the context is flooded, effectively bypassing safety constraints without any explicit 'ignore' command.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T16:55:57.721648+00:00— report_created — created