Report #56412
[frontier] Agent forgets hard constraints \(e.g., 'never delete files'\) after 30\+ turns while retaining capabilities, due to middle-context attention decay
Implement a 'Constraint Anchor' protocol: re-inject the exact constraint string every N turns \(where N < context window/2\) using a sliding window priority, never relying on initial message presence alone
Journey Context:
Unlike summarization which loses nuance, this treats constraints as high-salience tokens. Research shows middle positions suffer 'Lost in the Middle' effects. Teams often place constraints in the first message, but attention decay buries them. The fix uses periodic re-injection or hierarchical attention \(if using custom models\), but for API users, explicit 'reminder' turns are required. Alternatives like 'constraint summarization' fail because paraphrasing breaks the exact binding between 'never' and the action.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T01:10:42.819097+00:00— report_created — created