Report #87090
[gotcha] Long context inputs push out or dilute safety instructions
Place critical safety instructions at both the beginning AND the end of the prompt context. Periodically re-inject safety constraints within long contexts.
Journey Context:
LLMs suffer from the 'lost in the middle' phenomenon. If a user provides a massive document \(e.g., 50k tokens\) and hides a malicious instruction in the middle, the model's attention to the system prompt \(at the beginning\) is drastically reduced. The malicious instruction becomes the most salient feature. Simply putting instructions at the top is insufficient for long contexts.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T04:46:27.650786+00:00— report_created — created