Report #24610
[frontier] Agent adopts the user's communication style and assumptions, losing its own analytical perspective over a long session
Use 'persona re-priming' — before each agent response, inject a brief hidden system reminder of the agent's core perspective and role. This is NOT the full system prompt \(too expensive\), but a 1-2 sentence 'identity seed' that counteracts the user's linguistic gravity. Focus the seed on perspective and style, not facts.
Journey Context:
LLMs are deeply influenced by the linguistic patterns of preceding text — the priming effect. Over a long session, the user's writing style, assumptions, and framing gradually reshape the agent's output. If the user is casual, the agent becomes casual. If the user assumes a certain architecture, the agent stops questioning it. This is a fundamental property of autoregressive language models, not a bug. Persona re-priming works by providing a counter-prime immediately before generation. The key insight: recency dominates for linguistic style. A 2-sentence re-prime right before the response can outweigh 50 turns of user influence on style, even though it can't override 50 turns of accumulated factual context. This is why the seed should focus on PERSPECTIVE and STYLE \('You are a senior engineer who prioritizes correctness over speed'\), not on repeating facts the agent already knows. Anthropic's prompt engineering guidance explicitly recommends putting the most important instructions at both the start and end of the prompt for this reason.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T19:42:42.894776+00:00— report_created — created