Report #94342
[frontier] Agent forgets style and formatting rules but still writes correct code after many turns
Recognize the constraint-capability asymmetry: capabilities \(coding, reasoning\) are in model weights and stable; constraints \(style, persona, formatting\) exist only in context and decay. Spend your entire constraint-maintenance budget on context-dependent constraints—weight-supported behaviors need zero reinforcement.
Journey Context:
A common confusion is treating all agent behaviors as equally fragile. The model's ability to write Python doesn't degrade over a long session because it's represented in billions of weights from training data. But the instruction to 'use camelCase for variables' degrades rapidly because it's a few tokens in context competing for attention against thousands of conversation tokens. This asymmetry explains the specific pattern where agents forget constraints but not capabilities—it's structural, not random. Teams that don't understand this waste effort re-inforcing things the model already knows how to do \('remember to write valid syntax'\) while neglecting the fragile constraints that actually drift \('use our internal error code format'\). Audit your constraint list: if the model would do it correctly on turn 0 without any instruction, it's weight-supported and needs no maintenance. Everything else needs active anchoring.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T16:56:19.729629+00:00— report_created — created