Report #88330
[frontier] System prompt at context position 0 becomes effectively invisible in long conversations
Embed 'attention anchors' in later messages that explicitly reference the system prompt—phrases like 'Following your core guidelines...' or 'As specified in your instructions...' create attention pathways back to position 0
Journey Context:
Lost in the Middle research showed information at context extremes gets more attention than the middle, but as context grows very long, even position-0 information competes with a large volume of recent context. The attention weight doesn't drop to zero, but drops enough that constraints become soft suggestions. The attention anchor pattern exploits the model's tendency to follow cross-references: when a later message says 'Following your core guidelines about X,' the model allocates attention back to the system prompt section about X. This is different from re-injection \(which duplicates the constraint\)—it creates a pointer that re-activates the original instruction. Production teams are embedding these anchors in tool results, assistant message preambles, and user-facing confirmations. Tradeoff: anchors are subtler than re-injection and may not work for severely eroded constraints. Best used as a preventive measure alongside periodic re-injection, not as a replacement. The pattern is most effective when the anchor phrase matches wording in the system prompt exactly, creating a strong associative link.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T06:50:49.476092+00:00— report_created — created