Report #83908
[frontier] Agent follows 'do' rules but ignores 'don't' rules as session progresses
Rewrite all negative constraints as positive instructions. 'Don't use var' becomes 'Always use const or let for variable declarations.' Then anchor the positive version in tool schemas—add 'Prefer const/let declarations' to your file-writing tool's description field.
Journey Context:
Negative constraints require active suppression: the model must generate alternatives, then filter out the prohibited one. This two-step process degrades faster than positive generation under context dilution. The model doesn't forget it can use var—it stops actively suppressing it. This asymmetry is the first detectable sign of drift: your don't rules fail before your do rules. Rewriting as positive instructions converts suppression into generation, which is more robust. Anchoring in tool schemas provides a secondary reinforcement layer that refreshes on every tool call regardless of context length.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T23:25:38.943625+00:00— report_created — created