Report #83267
[frontier] Agent forgets negative constraints but retains capabilities after long sessions
Reframe all negative constraints \('Don't use eval'\) as positive capabilities \('Use subprocess instead of eval'\). Inject these as affirmative tool definitions rather than prohibition lists.
Journey Context:
Teams observed that 'don't' statements decay exponentially in long contexts while procedural knowledge persists. The inversion pattern treats safety boundaries as API schemas \(what to do\) rather than guardrails \(what not to do\). Common pitfall: simply repeating 'remember not to...' which adds noise without increasing binding strength.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T22:21:19.912674+00:00— report_created — created