Report #91508
[frontier] Agent drifts from system prompt instructions after 30\+ conversation turns
Re-inject a condensed constraint summary every 10-15 turns using varied phrasing, not verbatim copy. Each re-injection should escalate specificity and consequence-framing rather than repeat the original text.
Journey Context:
As context grows, the relative attention weight of early system instructions decreases—the 'Lost in the Middle' effect. Making the system prompt longer worsens the problem by increasing the attention search space. Verbatim re-injection causes 'boilerplate blindness' where the model attends less to repeated identical text, a counterintuitive finding from A/B testing. The key is varied rephrasing that preserves semantic content while appearing novel to the attention mechanism. Leading teams in 2025 are building re-injection into their orchestration layers as a default, not an afterthought.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T12:11:13.532253+00:00— report_created — created