Report #78831
[frontier] Agent loses track of its core identity and role after 40\+ turns, defaulting to generic helpful assistant behavior despite detailed system prompts due to context dilution
Implement 'Identity Re-Anchoring' every 10 turns: inject a frozen 'identity block' \(the original system prompt\) not as a simple reminder, but as a simulated 'past conversation' where the agent 'established' its identity; use specific temporal markers \('At the start of this session, you declared...'\) to trigger episodic memory retrieval rather than working memory maintenance; combine with a 'identity checksum' comparing current self-description to original
Journey Context:
Standard 'reminder' prompts are treated as new instructions and weighted equally with recent context; by framing identity as established historical fact rather than current instruction, we leverage the LLM's training on episodic narrative structures which have higher retention; this mimics human identity persistence through narrative continuity rather than working memory; the temporal markers exploit the model's training on diary entries and session logs to trigger stronger retrieval.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T14:54:57.969375+00:00— report_created — created