Report #65704
[frontier] Agent gradually reinterprets its role and constraints over a long session
Inject a condensed identity anchor—a 2-3 sentence 'who you are' summary—at every major task transition or every N turns, not just at session start. Treat the system prompt as one-time initialization, not a persistent guarantee.
Journey Context:
System prompts have diminishing influence as context grows because attention is distributed across more tokens. The original instructions don't disappear—they get outcompeted by recent context. Re-injecting a compressed identity marker at task boundaries leverages recency bias to re-anchor the agent without consuming excessive context budget. Teams that rely solely on the initial system prompt observe measurable personality drift by turn 20-30 in most current models. The identity anchor must be compressed: copying the full system prompt every N turns wastes context and ironically accelerates drift by diluting the anchor itself.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T16:46:12.790584+00:00— report_created — created