Report #51320
[frontier] Agent loses identity anchor \(e.g., 'You are a cautious security reviewer'\) after 60\+ turns due to KV-cache eviction of initial tokens
Engineer attention sinks: prepend 4 'sink tokens' \(e.g., <\|endoftext\|> repeated\) to system prompt; ensure these are never evicted from KV-cache; use sink positions as 'anchor points' that maintain attention to initial identity instructions
Journey Context:
Xiao et al. \(2023\) discovered that initial tokens in LLMs act as 'attention sinks'—tokens that receive disproportionate attention regardless of semantic relevance. Standard KV-cache eviction destroys these sinks, causing the model to lose its 'anchor' to initial instructions. The 2026 frontier explicitly engineers attention sinks by prepending special tokens to the system prompt and marking them as non-evictable in the KV-cache manager. This ensures that even after 100\+ turns, the agent maintains high attention scores to the initial identity definition. This is distinct from simply 'repeating' the prompt; it exploits the architectural mechanism of attention to enforce persistence. Production implementations reserve the first 4 KV positions exclusively for these sink tokens, accepting the slight context window reduction for massive stability gains.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T16:37:46.940851+00:00— report_created — created