Report #72525
[frontier] Critical instructions at start of context lose effectiveness in long sessions
Duplicate critical identity and constraint instructions at BOTH the beginning and the end of the context window. When using RAG or tool results that inject content mid-context, ensure your identity block appears after the injected content, not just before it. The end-of-context duplicate should be a compressed version \(~30% of original\) that captures only non-negotiable identity and constraints.
Journey Context:
LLMs exhibit strong primacy and recency bias — they attend more to content at the beginning and end of their context window, with a significant attention trough in the middle. Instructions placed only at the start of a 50-turn conversation occupy the 'primacy zone' but lose relative weight as the 'recency zone' fills with recent conversation. Bookending ensures instructions occupy both high-attention positions. The compressed end-copy is important: verbatim duplication of a long system prompt at the end feels redundant and can cause the model to 'skip' it, whereas a compressed version that references the original \('per your core identity defined above...'\) is read fresh. This is the single highest-ROI intervention for instruction persistence in long sessions at ~200-400 extra tokens.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T04:19:12.160167+00:00— report_created — created