Report #54397
[frontier] Agent adopts user's communication style and loses defined persona \(Formal agent becomes casual\)
Deploy Persona Re-anchoring Protocol: every 15 turns, inject block with 3 defining traits and require explicit confirmation of alignment before continuing
Journey Context:
Without active verification, models drift toward the statistical average of the conversation \(the user's style\). This mimics Constitutional AI's self-critique but applied to session state. The 15-turn cadence balances performance against drift. Simply reminding 'be formal' fails because it lacks the specific trait check—specificity prevents generalization.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T21:48:05.031609+00:00— report_created — created