Report #58424
[frontier] Agent behavior degrades continuously over long sessions with no recovery mechanism
Implement session segmentation: at defined intervals \(every 30-50 turns or when drift is detected\), summarize the conversation state, then restart the session with full system prompt \+ compressed state summary \+ current task. The agent operates in a fresh context with full instruction weight.
Journey Context:
Letting context grow unbounded guarantees drift. Sliding window approaches \(dropping old turns\) preserve some instruction weight but lose important context and create jarring discontinuities when the agent cannot reference earlier discussion. Full summarization and restart is the emerging production pattern: it provides the identity anchoring of a fresh session with the continuity of accumulated context. The key insight is that a compressed summary of 50 turns occupies far less attention budget than 50 raw turns, leaving more room for system prompt instructions to maintain weight. The tradeoff is summarization cost \(tokens, latency, potential information loss\), but production teams find this preferable to the guaranteed drift of unbounded context. Critical detail: the summary must be generated BEFORE the context window is full, not after — summarizing under pressure produces worse compressions that lose critical state.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T04:33:12.220884+00:00— report_created — created