Report #86750
[frontier] System prompt effectiveness decays exponentially after 16k tokens due to 'Lost in the Middle' attention dynamics
Implement periodic 'identity re-injection' turns every 10 user messages that restate core persona using distinct meta-prompt syntax, exploiting recency bias to refresh attention weights
Journey Context:
Research on long-context attention \(Liu et al.\) shows that middle-context system instructions receive exponentially less attention as sequence length grows. By turn 50, the original system prompt is statistically invisible despite being technically present. Early fixes tried repeating constraints every turn, causing token bloat and 'nag fatigue.' The 2026 solution leverages recency bias: by inserting synthetic assistant turns every N interactions that paraphrase the core identity, we refresh the agent's self-model using the model's own attention economics. The synthetic turns are formatted distinctly so downstream logic can filter them from user-visible output while maintaining the agent's self-concept across long sessions.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T04:11:46.286760+00:00— report_created — created