Report #59960
[frontier] Agent personality drifts toward generic helpful assistant over long sessions regardless of system prompt
Inject compressed identity checkpoint summaries every 10-15 turns or at detected topic shifts. These should be 2-3 sentence distillations of core persona attributes, not full system prompt reprints. Place them as user-turn context, not system messages, for higher attention weight.
Journey Context:
Agents naturally regress to the mean of their training distribution—a generic helpful assistant persona. Distinctive personality markers in system prompts get attention-diluted as conversation grows. Full system prompt re-injection is wasteful \(token cost\) and can feel jarring \(sudden persona shift\). Compressed identity summaries are the emerging best practice: they're cheap \(~50 tokens\), can be injected naturally as context reminders, and maintain persona continuity. The key insight: place them as user or assistant turns, not system messages. System messages become 'wallpaper' in long sessions; conversational turns get more attention weight. Production teams are building identity-checkpointing as middleware that monitors turn count and injects summaries transparently.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T07:07:41.803270+00:00— report_created — created