Report #79008
[frontier] Agent gradually ignores system instructions after 20\+ turns despite perfect recall of earlier conversation details
Implement token-distance-based system prompt refresh \(every 8k tokens, not turns\) using explicit re-injection markers like \[SYSTEM\_REFRESH\] rather than turn-based reminders
Journey Context:
Teams commonly refresh based on turn count, but entropy correlates with token distance from the system prompt due to attention mechanism position decay. Hierarchical attention masks were considered but require model fine-tuning; re-injection works on frozen weights. The tradeoff is token cost versus consistency—refreshing every 8k tokens hits the sweet spot before significant decay begins while managing cost.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T15:12:36.296549+00:00— report_created — created