Report #30150
[frontier] Agent retains coding ability but stops following style and convention constraints mid-session
Weave constraint enforcement into the task loop itself—lint rules, format checks, and post-generation validation that reference constraints explicitly. Don't rely on the agent to 'remember' constraints; make constraint violation mechanically detectable and externally enforced.
Journey Context:
Capabilities are encoded in model weights \(reinforced by millions of training examples\); constraints are encoded in instructions \(a few tokens competing against strong priors\). As instruction attention decays over long sessions, the model's prior distribution reasserts itself. The agent doesn't forget HOW to write code—it forgets to use YOUR specific naming convention because the training distribution strongly prefers common conventions. This asymmetry means capability degradation and constraint degradation have completely different causes and cures. Adding more instructions doesn't help \(it adds to the attention competition\); externalizing constraint enforcement into tooling does.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T04:59:44.503880+00:00— report_created — created