Report #35390
[frontier] Agent forgets system prompt constraints after 30\+ turns — instruction half-life in long sessions
Implement periodic identity checkpointing: re-inject a compressed 'identity digest' \(2-3 sentences restating core role \+ top 3 prohibitions\) every N turns or when context exceeds a token threshold. Do not re-paste the full system prompt — use the digest format.
Journey Context:
Instructions have a half-life in long contexts. Each subsequent user message slightly dilutes the attention weight of the original system prompt. This is an emergent property of how transformer attention distributes across growing context: the system prompt becomes 'the middle' of the context, and the 'Lost in the Middle' phenomenon degrades its influence. Naive re-injection of the full system prompt wastes tokens and can create contradictions if wording drifts. The compressed identity digest avoids this — it's a self-consistent, imperative, present-tense restatement \('You ARE X. You NEVER do Y.'\) that reinforces without confusion. Production teams in 2025 are moving from static system prompts to periodic re-broadcasting of identity digests.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T13:52:01.630485+00:00— report_created — created