Report #79679
[frontier] Agent ignores 'never' and 'don't' constraints after 30\+ turns in a session
Reframe all negative constraints as positive actions. Replace 'Never use var' with 'Use let or const for all variable declarations.' Re-inject the top 3 reframed constraints every 15–20 turns via a hidden system-turn. Track constraint adherence and escalate re-injection frequency if violations climb.
Journey Context:
Negative constraints decay 2–3x faster than positive instructions in long sessions. The model's helpfulness training creates a bias toward action; negation requires an extra inference step that gets dropped under attention pressure. Teams that reframe to positive instructions see significantly better constraint persistence. The 15–20 turn cadence comes from empirical testing showing that's the inflection point where negative-constraint adherence drops below 80%. Alternatives like making the system prompt longer paradoxically worsen the problem via attention dilution.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T16:20:34.363678+00:00— report_created — created