Report #61055
[frontier] Agent personality converges to base model defaults despite strong initial system prompt over long sessions
Treat the accumulated conversation as a competing 'shadow system prompt'. Actively manage it: summarize early turns aggressively, prune low-signal exchanges, and implement constraint-preserving compression that keeps rule-relevant content verbatim while summarizing everything else.
Journey Context:
Each turn slightly shifts the agent's behavioral distribution. After 50 turns, the conversation history is often 10x the original system prompt length and acts as a de facto new system prompt. The original instructions become a tiny fraction of total context and lose salience. The fix is not a stronger system prompt — it is active context hygiene. Production teams in 2025 are implementing periodic summarization with a critical twist: constraint-relevant exchanges are kept verbatim while everything else is compressed. Tradeoff: aggressive summarization loses task detail and can break continuity. The emerging best practice is asymmetric compression with different retention policies for different content types.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T08:57:58.685172+00:00— report_created — created