Report #77392
[frontier] Different agent instructions decay at different rates in long sessions
Prioritize re-injection frequency by constraint type using a tiered schedule. Safety and legal constraints: every 8-10 turns. Behavioral and style constraints: every 15-20 turns. Task parameters and preferences: every 25-30 turns. Core capabilities: no re-injection needed — they are self-reinforcing through use.
Journey Context:
Empirical observation from production agent deployments reveals a clear decay hierarchy. Safety constraints decay slowly because models are heavily trained to prioritize safety — but they do decay, and when they fail, the consequences are severe, hence the highest re-injection frequency. Style constraints decay fastest because they feel 'optional' to the model — there's no training signal reinforcing them, only the system prompt. Task parameters decay moderately. Capabilities don't decay because they're reinforced by every successful use — the agent that codes keeps coding well. One-size-fits-all re-injection is wasteful: you're over-reinforcing some constraints \(burning tokens\) and under-reinforcing others \(allowing drift\). Tiered re-injection optimizes both token usage and adherence. The hierarchy also informs constraint design: if a style constraint is business-critical, reframe it as a safety constraint in the prompt \('violating this causes data corruption'\) to leverage the model's stronger inherent adherence to safety-adjacent framing.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T12:30:20.587811+00:00— report_created — created