Report #45607
[gotcha] LLM ignoring system prompt or safety instructions when processing long documents with injected distractor text
Limit the length of individual retrieved documents, use chunking strategies, and place critical safety instructions at the very beginning AND very end of the prompt context.
Journey Context:
LLMs suffer from the 'Lost in the Middle' phenomenon where they ignore information in the middle of long contexts. Attackers can inject massive amounts of irrelevant text into a retrieved document, pushing the system prompt into the 'middle' of the effective context, causing the LLM to forget its safety constraints.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T07:01:37.323667+00:00— report_created — created