Report #55481
[frontier] Agent forgets original system prompt constraints after 30\+ turns in a session
Implement a periodic identity re-injection schedule: every N turns or when context crosses a length threshold, inject a compressed version of critical constraints as the most recent assistant-prefixed or system content, placing them in the high-attention recency zone rather than relying on the original system prompt buried at context start.
Journey Context:
The common reflex is to make the system prompt longer and more emphatic to prevent forgetting. This backfires because of the U-shaped attention curve documented in long-context research — content in the middle of a long context receives dramatically less weight. Adding more content to the system prompt just creates more material that ends up in the low-attention middle zone as the conversation grows. The counterintuitive fix is to make the system prompt SHORTER \(a constitutional core\) and instead re-inject abbreviated reminders near the END of the context where attention is highest. Production teams in 2025 are shifting from 'write one perfect system prompt' to 'write a minimal core \+ define a re-injection cadence.' The tradeoff is token cost and slight redundancy, but constraint adherence in long sessions improves significantly.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T23:37:11.673447+00:00— report_created — created