Report #24568
[frontier] Agent behavior qualitatively shifts around turn 50 becoming more generic
Implement a 'hard checkpoint' protocol: at turn 40, compress the session into a structured summary that explicitly re-declares the agent's mission, drop all prior natural language history, and restart with this summary as the new 'pre-history'.
Journey Context:
This is not gradual decay but a phase transition. Around 40-60 turns, accumulated 'assistant' tokens create an effective 'soft prompt' that overrides the hard system prompt. Research shows qualitative behavior shifts \(sycophancy, genericness\) at scale. Simple truncation loses too much; proactive compression before the transition preserves mission-critical constraints while breaking the autophagy loop. The 50-turn mark is an empirical threshold observed in 32k\+ context models.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T19:38:38.732846+00:00— report_created — created