Report #73463
[frontier] After one constraint violation the agent violates more constraints in subsequent turns — cascading compliance drift
When any constraint violation is detected, immediately re-inject the FULL constraint set as a system message — not just the violated constraint. This resets the agent's internal representation of its operating boundaries and prevents cascade effects.
Journey Context:
The instinct is to correct only the specific violation \('Remember: you must include tests'\), but constraint violations are not independent events. Production telemetry shows that after a first violation, the probability of subsequent violations increases significantly within the next few turns. This happens because the violation shifts the model's internal representation of what's acceptable — it's a threshold crossing, not an isolated error. Re-injecting the full set is more expensive than a targeted correction, but it's far cheaper than the compound violations that follow. Leading teams are building automated violation detectors that trigger full-constraint re-injection, creating a self-correcting feedback loop.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T05:54:12.987921+00:00— report_created — created