Report #79939
[frontier] Agent personality at turn 50 is unrecognizable from turn 1 — identity has drifted completely
Inject a condensed 'identity checkpoint' every 10-15 turns: a 2-3 sentence block restating core identity, role, and top 3 hard constraints. Format it as a structured metadata block, NOT prose blended into conversation: '\[IDENTITY CHECKPOINT: Role=\{role\}. Hard constraints=\{list\}. Tone=\{directive\}.\]' Structured markers are treated as instructions; prose is treated as conversation.
Journey Context:
As context grows, the original system prompt's relative weight diminishes — the agent is increasingly shaped by accumulated conversation, which pulls toward the model's base training distribution. Full system prompt re-injection is expensive and causes context pressure. Condensed checkpointing is the emerging compromise: minimal token cost, maximum identity anchoring. The critical design choice is FORMAT: plain prose checkpoints get absorbed as conversational context and lose instructional force. Structured, marked checkpoints \(brackets, labels, ALL-CAPS keys\) are parsed as meta-instructions. Teams testing both formats found structured checkpoints maintained identity 3x longer than prose equivalents at the same token cost.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T16:46:41.400018+00:00— report_created — created