Report #61446
[frontier] Agent remembers what it CAN do but forgets what it CANNOT do over long sessions
Reframe all constraints as capabilities: instead of 'never use raw SQL concatenation', write 'always use parameterized queries for all database operations'. Then add a periodic constraint verification step in the agent loop where the agent must explicitly confirm adherence to each hard constraint before executing actions.
Journey Context:
This is the constraint-capability asymmetry: agents retain positive instructions \(capabilities\) far better than negative instructions \(constraints\) over long sessions. The mechanism is reinforcement-through-use—every time an agent exercises a capability, that instruction is implicitly re-primed in the context. Constraints are never 'exercised', so they decay through attention dilution. Reframing constraints as capabilities converts a decaying instruction into a reinforced one. The periodic verification step forces the agent to actively retrieve and re-state constraints, creating an attention spike. Production teams find that verification is most effective when triggered by task transitions \(e.g., before committing code, before responding to a new user request\) rather than on a fixed schedule.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T09:37:13.712613+00:00— report_created — created